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TOWARDS THE AUTOMATIC GENERATION OF PROGRAMMED FOREIGN- LANGUAGE 
INSTRUCTIONAL MATERIALS* 

Joseph A. Van Campen 
Institute for Mathematical Studies 
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Stanford, California 9^305 



Introduction . 

An attempt to provide a thorough- going answer to the question of 
the extent to which the utilization of a computational system can facilitate 
the generation of programmed materials for the teaching of foreign languages 
would involve a consideration of so many elusive variables as to 
necessitate €i prolonged research effort by a number of specialists in such 
diverse areas, as computer hardware, systems programming, compilers and 
interpreters, peripheral devices, foreign language teaching, and programmed 
instruction. The purpose of this report is a much more modest one--to 
describe a set of programs, written for and successfully implemented on the 
PDP-10 computing system of the Institute for Mathematical Studies in the 
Social Sciences at Stanford University, which either perform certain tasks 
useful in the generation of programmed foreign-language instructional 
material or facilitate the writing of such task-oriented programs by olhe" 
researchers . 

Since ro further Federal funds have been requested by the principal 
investigator for research in this area, each program dealt with below will 
be presentee, insofar as possible, on its own terms, without overly great 
emphasis on the advantages which might flow from its integration with 
programs not as yet implemented, (The ties between two or more existing 
programs are, of course, pointed out in the introduction to the sections 
dealing with each of the individual programs.) 

1, Program for the Cycled Selection of Individual Vocabulary Items 
Belongirg to Certain Semantic Classes. 

1.0 Purpose and Justification. 

It would appear that a certain economy in the generation of foreign- 
language instructional material could be achieved by taking advantage of 
the fact that in the case of vast numbers of concrete utterances one or 
more of the vocabulary items which make up the utterance can be replaced 
by any one of a (in the case of elementary language courses almost always 
fairly limited) set of other vocabulary items without destroying the 
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acceptability of the utterance. Thus, for instance, in the English 
utterance the docto r Is vritlng a book , the word doctor can be replaced 
by any member of a set including such words as lawyer , man , teacher , 
author , and girl * The word book could be replaced by any member of a set 
which includes the words poem , story , novel , letter , and speech * It would 
appear that if we labelled the first set ’non-infant humans' and the 
second ’written object’ vt could rewrite the utterance as the ( non- inf ant 
huma n) is writing a (written object) , in which the set labels are set, off 
from concrete words by parentheses. Assuming that the words listed above 
exhaust the membership of the sets in question (a situation quite possible 
in the early stages of a first-year language course), ve see that the 
rewritten utterance could itself be rewritten in twenty- five different 
ways by replacing both of the set labels by different members of the sets 
in question. 

The above example is, of course, an extremely crude one. On the one 
hand it takes no account of the additional acceptable utterances which 
rnigh* be produced by allowing some variation in the grammatical categories 
expressed in the original utterance (e.g, the doctor has written a book , 
the doctors are writing a book , etc.). On the other hand, it ignores 
troublesome cases of the type the doctor is w riting the prescription (in 
which the set of items which can replaced doctor is very small) and such 
complications as the difference in set membership needed to account for 
the doctor J £ r eading ( not writing ) newspaper . 

However, there is as yet no reason to believe that, within the 
framework of a given language course, the use of set labels would not 
result in considerable economies in a large number of cases. The fact 
that such economies might, not. stand in any direct relationship to those 
attainable on the basis of a thorough analysis of the total semantic 
structure of a given language need not concern us here. 

Given the desirabi 1 it.y of the replacement of some or all of th^ 
constituents of utterance types by set labels, there can be no doubt 
that it would be useful to develop a program capable of replacing any 
given set label by an individual member of the set in question. Furthermore, 
if would also be desirable to have l) some record of the relative 
frequency of usage of the various members of the se‘ , and f ) a mean- of 
insuring that this frequency would remain relatively uniform for all 
members of 4 he set. 

1 . 1 Document a* ion 

1.11 Function of *he Program 

For the above mentioned ends the principal investigator wrote and 
implemented a PDP-lO assembly language program which examines a string containing 
one or more ret labels and replaces those labels it encounters with 
wails belonging to the sets in question. In addition, this program examines 
the usage index accompanying c -,ch member of the given set and selects the 
member to be used in a give"- *ase from the subset with the lowest index. 

Finally, the usage index of 4 he member selected is increased by one, unless 

O 




this would result in an index greater than the upper limit (decimal 9) • 

In the latter case, the index of the member selected is set at one and 
those of the other set members are reset to zero, 

1.12 File Format. 

1.121 Input Files. 

The input for the program consists of l) a disk file (channel l) 
containing one or more strings including set labels to be replaced by 
set members, and 2) a disk file (channel 2) containing an alphabetized list 
of set labels each accompanied by the members of the given set. 

1.121L String Format. 

Each string on channel 1 must include l) one or more set labels, 
each of which must be enclosed in parentheses, and 2) a slash, not 
enclosed in parentheses, indicating the end of the given string. The 
string may include other items not enclosed in parentheses, e.g., 
punctuation marks and concrete language material. The only restriction 
on such items is that they not include any of the following: an opening 

or closing parenthesis, a slash, a plus mark (see 1.12111), a comma 
(see 2.2322), a divide sign (see 2.2412), or either of the symbols < and 
> (see 1.1222). Except for the terminating slash, which is emitted from 
output strings, material not enclosed in parentheses is simply transferred 
from the input to the output file without affecting the flow of the program 
in any other way. 

1.12111 Format of Set Labels in Strings. 

As was pointed out in the preceding paragraph, set labels within 
input strings are regularly enclosed in parentheses, However, it may 
often prove useful for other purposes to include with a given set label 
additional information concerning the item in question, such as its 
syntactic role in the given string or possible restrictions on the set 
of grammatical categories (e.g. number, tense) it may exhibit . In order 
to permit the convenient notation of such additional information it was 
decided to utilize a plus sign placed immediately after the final 
character of a given set label in place of the closing parenthesis. The 
latter is used to terminate the information accompanying the label. 

Thus, material accompanying a set label may include any characters other 
than an opening or closing parenthesis, a slash or a plus sign. 

1.12112 Examples of String Input. 

1.121121 String Containing only Set Labels Unaccompanied by Additional 
Informal ion. 

(person) (emotion verb) (person)/ 

1.121122 Same String as 1.21121 with Punctuation Mark (Period). 

(person) (emotion verb) (perron)./ 
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1.121123 Same String as 1.21122 with Concrete Language Ivems. 

(person) does rot- (emotion verb) (person)./ 

1.121124 Same String as 1.21123 with Additional Syntactic Informal ion. 
(person + subject) does not (emotion verb) (person + direct object)./ 

1.121123 Same String as 1.2112^ with Additional Information on Category 
Restrictions. 

(person + subject singular) does not (emotion verb) 

(person + direct object plural)./ 

Note that only the items in 1.121121 are significant for the program 
in question. The other examples merely illustrate the manner in which 
additional material, presumably of use in connection with other programs, 
can be included along with the significant items. 

1.1212 Set-List Format. 

Each of the alphabetically ordered set labels on channel 2 is 
immediately followed by the members of the set in question. 

1.12121 Set-Label Format. 

Each set label is enclosed in parentheses. Labels may include any 
character except a parenthesis, slash, or plus mark. In contrast to 
1.2111, no additional information of any kind is permitted within the 
paren + heses enclosing the label. 

1.12122 Set-Member Format. 

1.121221 Set-Member Delimiters. 

Each member of a set is set off by l) a preceding less-than-or- equal 
sign (<), and ?.) a following great er-than-or-equal sign ( >) . 

Aside from the usage index discussed in 1.121222 below, the characters 
occurring between these two delimiters are never examined by the program 
and may include, in addition to the basic form of a given vocabulary item, 
additional coded information cn the Mem in quest ion, such as its inflectional 
or syntactic peculiarit ics. While it is obvious that c?rtain conventions 
would have to be established in order to insure the proper interpretation 
of such Informal iur. by programs concerned with generating inflected forms 
or establishing the appropriate categories to be assigned to words governed 
by or modifying the word in question, such convention? are irrelevant to 
4 he operation of the program under consideration. 

1.121222 Usage Index. 

Immcdiat ely after the initial < there is a single decimal digit 
indicating the relative frequency of usage for the wore in question. This 
digit, which can range from to 8, is increased by one each time the given 
word is selected for use in a concrete sentence. Tr.e jrogran rejects for 




use in any given case a set member with a usage index higher than that of 
any other member of the set, thus insuring a relatively uniform frequency 
of use for all members of the set. (When the usage index of each member 
of the set reaches eight, all of the indices in question are reset to zero.) 

1.12123 Examples of Set-List Format. 

1.121231 Set Labels. 

( person) 

(square object) 

(emotion verb) 

1.21232 Set Members (with zero usage index). 

< 0 man > 

< 0 cube > 

< 0 like > 

1.21233 Set Labels followed by Set Members (with zero usage index). 

( person) 

< 0 man > 

< 0 woman > 

< 0 boy > ~ 

( square object) 

< 0 cube > 

< 0 square > 

(emotion verb) 

< 0 like > 

< 0 hate > 

< 0 despise > 

< 0 adore > 

1.122 Output Files. 

The output of the program consts of l) a disk file (channel 3) 

consisting of one or more strings containing members of the sets specified 
in the input strings discussed under 1.1211 above, and 2) a disk file 
(channel 4) containing the updated version (i.e., the version with usage 
indices reflecting the utilization of set members employed in the output 
strings) of the set list described u.tder 1.1212 above. 

1.221 Output String Format. 

Each set label of the input string has been replaced by a member of 
the set in question. Where the set label of the input string was 
accompanied by additional information of the type described in 1.12111, 
such additional information is placed immediately before the closing 
delimiter of the set member. The usage index of the set member is omitted 
in the output string. 
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1.2211 Examples of String Output with Corresponding Input. 

1.22111 Input Set Labels Lacking Additional Information* 

Input: (person) (emotion verb) (person) . / 

Output: < man > < like > < woman >. 

1.22112 Input Set Labels Including Additional Information. 

Input: (person + definite) (emotion verb + 3* s. pres.) 

(person + definite)./ 

Output: < man + definite > < like + 3* s. pres. > 

< woman + definite > . 

1.222 Updated- Set- List Format. 

The format of the updated set list is identical with that of the set- 
list inpui file described in 1.1212. Only the usage indices of set members 
utilized in the output strings described under 1.221 differ from the 
corresponding items on the input file. 

1.2221 Example of Updated Set List Corresponding to Input Set 

List Given Under 1.21233 after Generation of Output String 
Given Under 1.22111. 

( person) 

< 1 man > 

< 1 woman > 

< 0 boy > 

(suuate object) 

< 0 cube > 

< 0 square > 

(emotion verb) 

< 1 like > 

< 0 hate > 

< 0 d-spise > 

< 0 adore 2* 

1,2 Evaluation and Prospects for Future Development . 

The program in question performs a useful, if sonevhat trivial, 
function. It could be made more valuable by the addition of two features: 
reiterative string generation and optional non- increment at ion of the 
usage ina^x of a given set member. The latter feature would allow the 
repeated utilization of one and the same set member within a single output 
string (as in, for example, this man likes Mary and that man hates her ) . 

The former would allow the generation of more than one output string for 
a single specified input string, pcrsumably by means of a decimal digit 
placed after the final slash of the string. Thus, using the sets given 
under 1 - 21233 , the input string 
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(person) (emotion verb) (person) . / 3 would give 

< man > 5 - 5 voman 2" 

< boy > < hate > < man > 

< voman > < despise > < boy > 

Whether or not reiterative string generation is implemented, it would 
be extremely desirable to have some means of avoiding undesirable limitations 
on the make-up of output strings resulted from random coincidences in the 
number of members constituting individual sets (e.g., the limitation of a 
group of ten output strings from sets A, B, and C, each with three members 
to the types A 1 B 1 C 1 , A 2 B 2 C 2 , and A^ C^) . Where the output strings 
are all generated from a single input specification this problem can be 
solved simply by comparing each new output string with those previously 
generated and replacing one or more members until either a new string is 
formed or all possible combinations are exhausted. In other cases one 
might insure that no two sets would have the same number of members by 
adding dummy members with an “illegal" usage index at the end of some of 
the sets. (Whenever such a dummy was encountered the first member of 
the set could be utilized, but only the illegal index usage of the dummy 
member would be raised. By keeping the illegal indices in a fixed relation 
to those of the real members of the set it would be possible to include more 
than one dummy member in a given set.) 

2. Coding System for Concise Formulation of a Relatively Large 
Number of Semantic-Syntactic Patterns. 

2.0 Justification and Purpose. 

As was pointed out in 1.1 above, the utilization of input- strings 
containing one or more set labels corresponding to groups of words sharing 
certain semantic features may enable us to generate a large number of 
concrete sentences from a single specified input string. Such input 
strings will henceforth be referred to as semantic- syntactic patterns, 
since, as we have seen in 1.12111, they may include as much information 
as necessary concerning the syntactic roles played by the members of the 
string. 

The question arises whether it might be possible to attain certain 
economies in the specification of the semantic- synt act ic patterns 
themselves by developing a coding system which would permit two or more 
semantic-syntactic patterns including one or more common set labels to 
be included within a single 1 master pattern. 1 From the point of view of an 
individual generating the input for a language course, such a formulation 
would presumably be more economical than the separate specification of each 
of the semant ic-synt act ic patterns in question. It vould, of course, 
necessitate the development of some procedure for the retrieval of 
individual patterns from a master pattern, a procedure discussed in 
section 3 below. 

The exact extent to which the use of master patterns would prove more 
economical than the specif icat ion of all individual patterns is difficult 
to predict. However, it would appear that in certain cases considerable 



gains could be made. One of these cases is that of a noun which can occur 
with only one of a set of mutually exclusive modifiers, e.g. the noun 
house, which can be modified by the definite article, the indefinite 
article, a demonstrative adjective or a possessive adjective (e.g. the house , 
a house, this house , my house), but not, at least in normal usage, by two 
or more of these at the same time (e.g. *the a house , *a my house , * this 
the house , etc.) 

Another case occurs when a word dependent on another word may, but 
need not, itself be modified by yet another word , e.g., good books , very 
good books, write letters , write snort letter s, write very short letters . 

While it may be that a more sophisticated approach to input coding 
will account for such cases by the use of general algorithms based on 
considerations of semantic and/or syntactic compatibility, it would not be 
unreasonable to provide a more immediate means of reducing the redundancies 
which might arise from an approach limited to the specification of 
individual patterns. 

2.1 General Characteristics of a Desirable Coding System. 

A coding system for master patterns should make a clear distinction 
between items which must be present in any and all the individual patterns 
general able from the master pattern and those which are absent from one 
or more of the individual patterns. Within the latter group h. rr'. t 
distinguish, on the one hand, between items which can cooccur and items 
which are mutually exclusive, and, on the other, between items which can 
occur independently of any other item and items which are dependent on 
another item, i.e., which can occur in an individuLl pattern only if another 
item is present. Finally, the coding system must provide for the above 
distinctions with a maximum of simplicity so that the specification of 
master pattern does not involve significantly greater effort and 
possibility for error than the specification of an individual pattern 
including approximately the same nur.be r of characters. 

2.2 Introductory Comments on The Proposed Coding System. 

2.21 Relation of the Proposed Coding System to the Format of individual 
Semantic -Syntactic Patterns . 

Since individual semantic- syntactic patterns must contain at least 
one or more set labels delimited by parentheses (cf. 1.1211) it would 
seem reasonable to utilize parentheses as the fundamental delimiter in 
master patterns as veil. On the other hand, since the plus sign can occur 
within individual set labels (cf. 1.12111), it. cannot conveniently be 
used as a set-label delimiter in a master pattern. The slash used to 
terminate individual syntactic-semantic patterns can be retained to 
signal the end of a master pattern. 

2.22 Relation of *he Proposed Coding System to Boolean Algebra. 

The proposed coding system, while utilizing somewhat different symbols, 
was strongly influenced by Boolean algebra in l) the utilization of 
parentheses, and ?.) the expression of conjunction and disjunction. Since 
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it may be reasonably assumed that a large percentage of those reading 
this report vill be familiar with the Boolean notation, it would seem 
that a discussion of the features of the proposed coding system can best 
be carried out by contrasting these features with their Boolean 
counterparts . 

2.25 Conjunction. 

2.231 Use of Parentheses to Express Subordinat ive Conjunction. 

As was pointed out in 2.21, the utilization of parentheses in a 
delimitative function allows us to retain in master patterns symbols 
employed as delimiters in concrete semantic- syntactic patterns. In Boolean 
algebra the parentheses are utilized for determining the order of operations. 
In the proposed coding system this function has been essentially retained 
(for illustrations see 2. 52 and 2.53 below). In addition, however, 
parentheses are employed in noting a subordinat ive conjunctive relationship 
between a pair of set labels, i.e., a relationship in which one of the 
conjoined set labels is grammat ically dependent on (e.g. in agreement 
with, a modifier of, governed by) the other. This is accomplished by 
placing the dependent (i.e. governed, agreeing, or modifying) seu label, 
together with its enclosing parentheses within the parentheses which enclose 
the non-dependent member of the pair. Thus, for example, the set-label 
pair (noun subject (verb predicate)) indicates that the predicate depends 
on (agrees with) the subject. 

This additional function results in a much more extensive utilization 
of parentheses in our notation than in Boolean algebra, since there are 
numerous cases which necessitate the noting of subordinat ive conjunction 
between members of a set-label pair, even though the order of operations 
is irrelevant. 

Thus, in Boolean algebra (A(B(C))) is merely an extremely uneconomical 
notation of (ABC) or simply ABC. In our notation, since it includes no 
indication of non-subordinative conjunction or of disjunction (see 2.232 
and 2.24 below), (ABC) represents simply a single set label (recall that 
no set label may occur without parentheses), while (A(b(c))) represents 
three conjoined set labels such that C is dependent on (modifies or is 
governed by) B and B is dependent on A. The sequence (AB(c)) represents 
two conjoined set labels, (AB) and (C), the latter modifying or being 
governed by the former, while (A(BC)) shows the same relationships between 
the set labels (A) and (EC). 

2.2311 The Ordering of Dependent Set Labels. 

In order to simplify the operation of the program discussed in 
section 3 below, a dependent set label cannot precede the set label on 
which it is dependent. There is, for example, no such pattern as ((A)BC) 
or ((AB)C). The notation for such cases must be { BC { A ) ) and ( C ( AB) ) , 
respect ively. 



It follows that our dependency notation may entail significant 
differences between the order of set labels included in a master pattern 
and the order of concrete words occurring in sentences generated i rom 
individual semant ic-synt act ic patterns derived from the master pattern. 
Thus, a string of the type p oor men might correspond to a master pattern 
including the set -label sequence^ person (wealth adjective)). Provisions 
for converting dependency-oriented sequences to the order actually 
employed in utterance strings are discussed in section 3 below. 

2.2312 Examples of the Use of Parentheses to Express Subordinat ive 
Conjunct ion. 

2.23121 Master- Pattern Notation Matches Utterance Order. 

Master-Pattern Novation: (person (emotion verb( person) )). / 

Utterance; John likes Mary. 

2.23122 Master- Pattern Notation Differs from Utterance Order. 

Mast er- Pattern Notation: (writing verb{vriting object ( duration 

adject ive (adjectival intensifier) ) ) ) . / 
Utterance: Write very short stories. 

(The master-pattern orucr (A(B(C{D)))) corresponds to the 
utterance order ADOB. ) 

2.232 Use of the Comma and Parentheses to Express Non-Subordl native 
Conjunct ion. 

2.2321 Definition of Non- Subordinat ive Conjunction. 

It is possible that two set labels which are themselves not members 
of the same dependency pair (i.e., do not have a subordinat ive 
conjunctive relationship with one another) may nevertheless play identical 
(dependent or, less frequently, non- dependent) roles in separate 
subordinat ive-conjunct ive relationships with one and the same set label. 
From the point of view of grammatical analysis, such set -label pairs 
fall into two categories: those which are clearly cases of coordinative 

conjunction (e.g., big , black c louds ) and those in which one of tv'o 
dependent set labels can be viewed as dependent not on a single non- 
dependent set label but on the dependency pair formed by the non-dependent 
set label with the other dependent set label. Thus, for instance, in the 
phrase t h i s old book the demonstrative adjective can be viewed as 
modifying the phrase old took . Again, in he rarel y writes letters 
the adverb may be viewed as dependent on the phrase writes letters . 

However, it would appear that little or nothing is gained by 
providirig separate notations for these two types of sequences. In all 
the cases which have come to the attention of the principal investigator 
thus far, there is no evidence that items which could be viewed as 
dependent on a dependency pair as a whole behave differently from items 
dependent only on the non- dependent member of such a pair. Thus, the 
agreement of the demonstrative in such phrases as this old bock follows 
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the r;ame rules as in phrases of the type this book . Again the adverb in 
he rarely writes letters is subject to the same rules as the one in he 
rarely writes . 

For this reason it would seem in the interests of notational economy 
to view both of the cases discussed above as examples of a single 
phenomenon--non- subordinat ive conjunction, i.e. a conjunctive relationship 
between two set labels not entailing the dependency of one of them on the 
other, Accordingly, phrases of the type this old book will receive the 
cine notational treatment as those of the type old , tattered books . 

2.2J22 Notational Devices. 

2.23221 Non-subordinative Conjunction of Two Set Labels Dependent on 
Third Set Label. 

Since by definition set labels which stand in a non-subordinative 
conjunctive relationship to one another also play identical roles in 
separate subordinat ive conjunctive relationships to a third set label, 
it follows that if the non- subordinat ive ly conjoined set labels are 
dependent on the third set label it would be possible simply to include 
both of the former within the parentheses enclosing the latter. For 
example, (A(B)(c)) would represent the non- subordinat ively conjoined pair 
of set labels (B) and (c), each of which is dependent on the third set 
label (A). However, it would appear that not only notational economy, 
but also graphic clarity v-ould be served by introducing into our system 
a new delimi* at ional symbol which would permit us to include 
non- subordinat ively conjoined set labels within a single set of parentheses, 
Since the dot and the multiplication sign, commonly employed to denote 
'onjunction in Boolean algebra, are easily confused with the period and 
ihe letter x, respectively, it was decided to utilize the comma for this 
purpose. Thus, in place of (.<(B)(c)) we may write (A(B,C)), 

2.23222 Non-Subordinative Conjunction of Two Set Labels with 
Dependent Third Set Label. 

It might seem at first glance that the notation of cases of this 
type (e.g. expensive hats and shoes , in which the adjective applies to 
both nouns) could follow the pattern established in 2 . 23221 , i.e., that 
the non- subordinately conjoined set labels could be separated by a comma 
and the dependent third set label included within the parentheses 
surrounding the non- dependent pair. Thus, (A,B(C)) would represent the 
non- subordinat ively conjoined set labels (A) and (B) and the third set 
label {c) which depends on both (A) and (b). 

Unfortunately, the utilization of this notation for cases of this 
type is rendered less than desirable by the need for a convenient 
representation of such phrases as this very old book , in which, from the 
point of view of our notational system, the adjective old participates 
in three relationships: 1) non-subordinative conjunctive with this , 

2) subordina f ive conjunctive with the non-dependent item book , and 

3) subordinat ive conjunctive with the dependent item very . In accordance 
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with 2.2J221, the master pattern for phrases of this type could include 
( noun{ demonstrative adjective, age adjective)). However, it would appear 
that the most convenient and graphically clear notation of the set label 
for the adjectival intensifier modifying old would be its inclusion between 
the set label corresponding to that adjective and the first closing 
parentheses, i.e., ( noun( demonstrative adjective, age adjective ( ad jectival 
intensifier) ) ) . 

It would therefore appear that, unless ve wish to modify the notation 
adopted in 2.23221, it would be best to devise another notation for phrases 
of the type expensive hats and shoe s . It would seem that this could be 
accomplished with maximal clarity by l) placing the dependent third set 
label outside the parentheses surrounding the non-subordinat ively conjoined 
pair and 2) indicating its dependence on the preceding pair by enclosing 
it within an extra set of parentheses. Thus, instead of (A,B(c))--a 
notation reserved for cases in which (C) is dependent only on (B)--ve would 
write (A, B) ( (C) ) . 

2.23223 Non-Subordinat ive Conjunction of More Than Two Set Labels. 

It is quite possible that the non-subordinat ive conjunctive relation- 
ships discussed in 2.23221 and 2.23222 may hold among more than two set 
labels. Thus, instead of this old book or old , t altered books ve might 
have this old, tattered book , with three non-subordinatively conjoined 
items dependent on book . On the other hand, in addition to phrases such 
as expensive hats and shoes ve can expect expensive hats, shoes and gloves , 
in which expe nsive is dependent on three conjoined nouns. 

The notation of such cases can be adequately handled simply by 
extending the techniques discussed in 2.23221 and 2.23222 to provide for 
the insertion of a delimiting comma after every non-subordinatively 
conjoined set label except the last. Thus, the first series cited In 
the preceding paragraph would correspond to the notation (A(B,C,D)), 
while the second would be written as (A, B,C)( (D) ) . 

2. 2*23 Examples of Non-Subordinat ive Conjunction. 

2.23231 Two Non-Subordinatively Conjoined Set Labels. 

2.232311 Non-Dependent Third Set Label. 

Master- Fat ♦ ci n Not at ion: ( noun (domonst rat ive pronoun, age adjective))/ 

Utterance: this old book 

Martci -Fa* + crn Notation: (r,cur.(a£c adjective, condition adjective))/ 

Utterance: old, tattered looks 

2.232312 Dependent Third Set Label. 

Master-Pattern Notation: (purchase verb, sale verb) { (object noun))/ 

Utterance: buy and sell books 

Master- Fa* 4 ^rn Not at ion: (male adult, female adult )( (vealt h adjective))/ 

Utterance: rich rxn and women 
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2.2323? More than Two Noo-Subordinatively Conjoined Set Labels. 

2.232321 Non- Dependent Subordi natively Conjoined Set Label, 

Master Pattern: (noun{ demonstrative pronoun, age adjective, 

condition adjective))/ 

Utterance: this old, tapered book 

2.232322 Dependent Subordinatively Conjoined Set Label. 

Master Pattern: (purchase verb, sale verb, exchange verb) ( (object 

noun) )/ 

Utterance: buy, sell, and trade books 

2.24 Disjunction. 

2.241 Primary Role of Exclusive Disjunction. 

Since, as was pointed out in 2.0 above, one of the strongest 
Justifications for the creation of a master-pattern notation lies in the 
fact that certain words can occur with any one of a set of mutually 
exclusive modifiers, it is not unreasonable to attend first to the 
notation of the "exclusive or" relationship. 

2. 2411 Definition of Exclusive Disjunction. 

For our purposes an exclusive disjunctive relationship can be said 
to occur between two or more set labels in a given master pattern when 
l) for any concrete semantic- synt act ic pattern derivable from the given 
master pattern only one of the set labels can be present, and 2) any one 
^ of the set labels, if it is indeed present in a concrete semantic-syntactic 
pattern, will play one and the same role in a dependency pair with one 
and the same subordinat ively conjoined set label. Thus, for instance 
phrases of the type my book and this book could be derived from a single 
master- pat tern formulation in which the set labels corresponding to my and 
this are in exclusive disjunction. 

2.2412 Notation. 

2.24121 Exclusive Disjunction between Two Set Labels. 

Since ve have already employed the plus sign within set- labels 
(cf. 1.1211) it cannot conveniently be employed to indicate a disjunctive 
relationship between two set labels. Since, however, it seems desirable 
to utilize a symbol not normally employed in the strings constituting 
such labels, and because of the associative ties between disjunction, 
separation and division, it was decided to employ the divide sign (1) to 
separate the members of an exclusive disjunctive pair. The subord i native 
conjunctive relationship of each member of the pair to a third set label 
in the master pattern is indicated by the devices discussed in 2.23221 
and 2.23222 above. Thus, (A 4 B) indicates that the set labels (A) and (B) 
are in exclusive disjunction; (C(A 4 B)) indicates that whichever one of 
them is present in a given string will depend on (C)j while (A 4 B)((c)) 
indicates tha* (C) will depend on whichever one of (A) or ( B) occurs in a 
given string. 
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2.2bl22 Exclusive Disjunction between More Than Two Set Labels* 

As in the case of non- subordi native conjunction, we simply modify the 
notation developed for pairs of set labels by writing a delimiter r.fter 
each set label except the last. Thus, with a non-dependent subordi natively 
conjoined set label we write (A(B 4 C * D)), while with a dependent one 
we write (A 4 B 4 C)((D)). 

2.2*413 Example of Exclusive Disjunction. 

Master Pattern: (noun( demonstrative adjective 4 possessive 

adjective t definite article)) 

2.24131 Concrete Patterns. 

2.2*41311 (noun{ demonstrative adjective)) 

2.2*41312 {noun(possessive adjective)) 

2 . 2*41313 ( noun( definite article)) 

2.2*4132 Utterances . 

2.2*41321 Corresponding to 2.2*41311. 
this book 

2.2*41322 Corresponding to 2.2*41312. 
my book 

2.2*41323 Corresponding to 2.241313. 
the book 

2.2*42 Inclusive Disjunction. 

2.2421 Definition of Inclusive Disjunction. 

For our purposes an inclusive disjunctive relationship can be said 
to occur between two or more set labels in a given master pattern when 
l) any combinat ion of the set labels in quest ion can occur within a 
concrete semant ic - synt act ic pattern derived from the given master pattern 
ar,dj2) each of the set labels which does occur in a concrete semantic- 
syntactic pattern will play one and the same role in a dependency pair 
with one and the same subordi natively conjoined set label. Thus, If within 
a single master pattern, in addition to phrases of the type, old , 
t altered books , which were cited in conjunction with non- subordi nat ive 
conjunction (of. 2.23221), we wish to make provision for phrases such as 
old books and t altered book s , ve need simply change the relationship 
between the set labels (age adjective) and (condition adjective) to one 
of inclusive disjunction. 
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2.U22 Notation. 



There would appear to be no strong reasons for introducing a new 
symbol to represent the inclusive disjunctive relationship. First of 
all, as is pointed out- in 2.631232, this type of relationship is not 
likely to occur with any great frequency in the material with which 
we are concerned. Secondly, it is adequately covered by the notational 
devices utilized to indicate restrictions on the omission of optional 
dependent set labels (cf. 2.631232 and 2.632232). 

2.5 The Ordering of Operations. 

2.51 Relationship of Ordering to Subordinative Conjunction. 

In order to permit the derivation of an optimally large number 
of concrete semantic- synt act ic patterns from a single master pattern, 
it is necessary to allow for a wide variety of disjunctive and 
conjunctive relationship ’'networks’ 1 involving a large number of set 
labels. This in turn necessitates a set of rules governing the order 
in which we will perform the selection or grouping of set labels in 
deriving concrete patterns. Thus, if we are faced with a master 
pattern of the type (A 4 B(C * D)), we must be able to decide whether 
one of the selective operations indicated by the two disjunction 
symbols depends on the results of the other. A similar question must 
be asked about the selection and grouping operations indicated by the 
divide sign and the comma in (A 4 B(C,D)). 

As was pointed out in 2.231; a subordinative conjunctive 
relationship between two set labels indicates that one of them is 
grammatically dependent on the other. In addition, if the non- 
dependent label or labels with which a given dependent label is 
subordinat ively conjoined is (are) not non-subordinat ively conjoined 
with a dummy label (cf. 2.63 and 2.7 below), the dependent label 
can occur only in those concrete semant ic- synt act ic patterns which 
include (one of) the non- dependent label(s). 

It follows that subordinative conjunction can serve as a guide 
to the ordering of operations, since operations affecting the dependent 
set label of a given dependency pair need be perfoimed only after the 
completion of these operations which determine the presence or absence 
in a given concrete pattern of the set latcl(s) v T ith which the 
dependent label must te conjoined. 
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Parentheses Depth as a Guide to the Ordering of Operations 

Since in noting subordinat ive conjunction ve place the dependent 
set label(s) either within the parentheses enclosing the non-dependent 
label(s) or (cf. 2.23222) within an extra set of parentheses, it follows 
that ve can order our operations in the manner described in 2.51 simply 
by performing first those operations enclosed by the smallest number of 
sets of parentheses. 

Operations entailing a greater number of sets o^ parentheses would 
be performed only if they involved set labels which were subordinat ively 
conjoined with one or more set labels selected from the master pattern 
as a result of the performance of preceding operations. 

Thus, for example in (A 4 B(C)), ve would first choose between (A) 
and (b). Only if (B) were chosen would ve proceed to the selection of 
(C), since (cf. 2.23222) this label is not subordinat ively conjoined with 
(A). Similar considerations would apply to (A * B(C * D) ) or (A 4 B(C,D)). 
On the other hand, in (A 4 B)((c)), ?C) would always be selected, since it 
depends on either (A.) or (B). 

2-521 The Utilization of Additional Sets of Parentheses. 

The ordering of operations in accordance with the hierarchy of 
subordinat ive conjunction does not permit us to impose different orders 
of grouping and selection on a number of set labels all of which play one 
and the same role in subordinat ive conjunction with one and the same set 
label. Thus, we have so far no way of deciding whether (A(B,C + D,E)) 
will result in the concrete patterns (a(B,D,E)) and (A(B,C,E)) or in 
(A( BC) ) and (A(DE)). 

It would appear that the simplest solution to this problem is the 
introduction into our notation of additional sets of parentheses not 
needed for the representation of subordinat ive conjunction. Thus, if 
ve wish to insure the derivation of the first pair of concrete patterns 
mentioned in the preceding paragraph, ve need only writ ? ( A( B, ( C + D) ,E) ) , 
while the derivation of 'he second pair is assured by + (D,E))). 

Note that the additional parentheses do not replace any of the ether 
operational symbols. This feature, which leads to the- use of what may 
appear *0 be superfluous sets of parentheses in certain cases (cf. the 
second notation above), was introduced to simplify the urogram discussed 
in sec* ion 3 below. 

2.53 Examples of the Derivation of Concrete Scmant D -Synt act ic 
Fat terns from Master Fatforsn in which the Ordering of 
Operations is Significant. 

2.531 Additional Parentheses Not Required. 

Master Pattern: (proper name 4 kinship noun(propo.* name 4 possessive 

adject ive) )( (oral noise verb))/ 
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2.533,1 Concrete Pattern Resulting from Selection of Set Label 
( Proper Name) . 

(proper name (oral noise verb))/ 

2.5312 Concrete Patterns Resulting from Selection of Set Label 
(Kinship Noun)/ 

2.53121 (kinship nour( proper name, oral noise verb))/ 

2.53122 (kinship noun( possessive adjective, oral noise verb)) 

2.5313 Utterance Corresponding to 2.5311* 

John is singing. 

2.53141 Utterance Corresponding to 2.53121. 

Mary's brother is talking. 

2.53142 Utterance Corresponding to 2.53122. 

Their nephew is crying. 

2.532 Additional Parentheses Required. 

Master Pattern: { noun( ( possessive adjective + demonstrative 

adjective), age adjective))/ 



2.5321 


Concrete Patterns. 




2.53211 


( noun( possess ive adjective, age 


adjective) ) / 


2.53212 


( noun( demonstrative adjective, 


age adjective))/ 


2.5322 


Utterances . 




2.53221 


Corresponding to 2.53211. 





my old house 

2.53222 Corresponding to 2.53212* 
this new table 

2.6 Optional Relationships. 

2.6l Definition of Optional Relationships and Optional Set Labels. 

As was indicated in 2.0, there are numerous cases in which a set 
label may, but need not, be subord inat ively conjoined with a dependent 
set label. In such ca^es the subordinat ive conjunctive relationship can 
be referred to as an optional relationship and the dependent set label as' 
an optional set label. Thus, in a master pattern notation ( nounfqualit y 
ad Ject ive(adject ival ini ens if ier) ) ) , corresponding to phrases of the type 
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very good books , there is one optional relation ship- -(quality adjective 
(adjectival intensif ier) ) --and one optional set label-- (adject ival 
intensif ter) . 

2,6 2 The Need for a Rotational Device. 

It wight at first appear that there is little or no need to provide 
a separate notation for optional relationships and/or set labels. 

Indeed, in the case just cited and in many others it would be possible 
to account for such concrete patterns lack! rig the optional items-- in 
this case the pattern (noun(quality adject ive) )- -by a general algorithm 
based on the rule for the ordering of operations. Thus, the dependent 
set label of a subordi natively conjoined pair might be regarded as an 
optional set label and the subordi native conjunction viewed as an optional 
rela T ionship whenever the non-dependent set label also occurred as the 
dependent member in a subordinat ive conjunctive relationship with a 
third set label. 

There are, however, a number of cases in which such an approach 
would fail. Thus, iri representing prepositional phrases modifying a 
verb ve employ the pattern ( verb( preposit ion( noun) ) ) . In many cases the 
omission of the set label (noun) will result in a concrete pattern 
leading to the generation of unacceptable utterances. Compare for 
instance he reads in bed and he reads in . Again, certain transitive 
verbs (e.g, pulverize , compress ) hardly ever occur without a direct 
object in normal speech, while others (e.g. read , write ) do so quite 
frequently. It would, therefore, appear that the development of a 
separate notation for optional relationships and/or labels is both 
necessary and useful . 

2.6} The Utilization of the Dummy Label to Indicate 

Optional Relationships, 

2.6}1 Rationale. 

Since, on the one hand, it would seem desirable to keep the set of 
symbols r*o 4 permit- ed within set labels as small as possible, and since, 
on the other, it would seem less than desirable to develop new notat ional 
devices for optional varian* s of each of the three types of relationships 
discussed above, it was decided to signal the presence of an opt ional 
relationship by placing the op* ional set label in exclusive disjunction 
with a 11 dummy" or ’zero' set label consisting of the single digit 0. 

Note that * ho utilization of the dummy label (0) does not force us to 
exclude the character 0 from other se* labels. The only restriction it 
erg ails is S ' exclusion of set labels consisting solely of the 
character 0. 

2.6311 Optional Relationship Affect ing Only Gn^ Dependent Set Label. 

This tyf -» is. ext remely simple, requiring only the insertion of the 
divide sign avl the dummy label after * h* optional label. Thus, while 
(A(E(C))) includes no op* ional rela* ionship, in (A(B(C * 0))) the 
relationship (P(o)) and the set label (C) are optional. Again, in 
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(A 4 B)((C)) the concrete patterns are limited to (A(C)) and (B(C)), 
while in (A 4 B)((C 4 0)) they include (A(c)), ( B( C ) ) , (A), and (B). 

2.6312 Optional Relationship Affecting Two or More Dependent Set Labels. 

2.63121 Dependent Set Labels in Exclusive Disjunction. 

This type is also quite simple: since only one of the dependent 

set labels participating in such a relationship can be present in any 
concrete semant ic- synt act ic pattern, we can make the relationship 
optional by the same means as in 2.6311* Thus, non-optional {A(B 4 C)), 
which allows only the concrete patterns (A(B)) and (A(C)), corresponds 
to optional (A(B 4 C 4 0)), which permits (A) as well. 

2.63122 Dependent Set Labels in Sj.bordi native Conjunction. 

This case too is taken care of simply by inserting the exclusive 
disjunction symbol and the dummy label. Thus, non-optional (A(B(C))), 
which generates only one concrete pattern, corresponds to optional 
(A(B(C) 4 0)), whicn permits both (A(B(c))) and (A). 

2.63123 Dependent Set Labels in Non-Subordinat ive Conjunction. 

This case is somewhat more complicated than the preceding ones, 
since it requires the utilization of additional sets of parentheses to 
define the order of operations (cf. 2.&21). 

2.631231 Unrestricted Omission of Optional Labels. 

This type entails no complications other than the use of additional 
sets of parentheses. Thus, if the non-optional notation is (A(B, C,D)), 
which generates only one concrete pattern, the notation (A(B,C,(D 4 0))) 
permits both (A(B,C,D)) and (A{B,C))j (A(B,(C 4 0),D)) allows (A(B,C,D)) 
and (A(B,D)); (A(B, (C 4 0^{D 4 0))) gives all of the above plus (A(B)); 
{A((B ♦ 0),C,D)) generates (A(B,C,D)) or (A(C,D)); etc., etc. The 
maximum number of concrete patterns is generated by (A((B 4 0),(C 4 0), 

(D ♦ 0)))) which permits any combination of dependent labels with the 
label (A), as veil as a concrete pattern consisting of (A) alone. 

2.6312*2 Restricted Omission of Optional Labels. 

It may prove desirable in some cases to insure that all the 
concrete patterns derived from a given master will include at least one 
(or, less probably, more than one) member of a group of non-subordinat ively 
conjoined optional set labels. Thus, one might wish to permit phrases 
of the type he speaks Russian veil , he squeaks veil , and he speaks Russian , 
but to exclude phrases such as he speaks . While there is some doubt as 
to whether restrictions of this type are likely to be utilized very 
often, they can be adequately conveyed by a combination of optional non- 
eubordinat ive conjunction and non-optional exclusive disjunction. 

Thus, if we wish *0 generate the concrete patterns ( A( B, C) ) > ( A( B) ) and 
(A(c)), but to exclude (A), we need write not ( A{ { B 4 0),(C 4 0))), but 
(A((B,(C « 0))4C}). 
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Insole:: as only one of the optional dependent set labels need be 
present in any concrete pattern, the restricted omission of optioral set 
labels is equivalent to an inclusive disjunctive relationship am on 3 them. 
As was pointed out in 2 .422, this fact, combined with the relative 
infrequency of tdiis relationship, obviates the necessity for a special 
notational device representing inclusive disjunction as such. 

2.632 Examples of the Derivation of Concrete Semantic-Syntactic 

Patterns from Master Patterns including Optional Relationships. 

2.6321 Only One Dependent Set Label. 

Master Pattern: (person( reading verb( reading object + $) ) ) . / 

2.63211 Concrete Pattern including Optional Relationship. 

( per son( reading verb( reading object)))./ 

2.63212 Concrete Pattern without Optional Relationship. 

(person (reading verb))./ 

2.63213 Utterance Corresponding to P.o'P'n. 

John is reading a book. 

2.63214 Utterance Corresponding to 2.63212. 

John is reading. 

2.6322 More than One Dependent Set Label. 

2.63221 Dependent Set Labels in Exclusive Disjunction. 

Master Pattern: (noun(demonstrat ive adjective i possessive 

adjective * /))/ 

2.632211 Concrete Patterns with Optional Relationship. 

2 . 6 322 111 ( noun ( dr irons * rat i ve ad Je ct ivr ) ) / 

2.6322 112 { noun( possessiv? adjective))/ 

2.632212 Concrete Pattern without Optional Relationship. 

( noun)/ 

2.632213 Utterances. 

2 . 6j221 3 1 Co r re s \ c nd i ng to 2.6 322 1 L 1 . 
this ha 4 

2.6322132 Corresponding to 2.6322112. 

my hat 
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2. 6522^:53 Corresponding to 2.632212. 
hat 



2.63222 Dependent Set Labels in Subordinative Conjunction. 

Master Pattern: (noun(age adjective(adjectival intensifier) + 0))/ 

2.632221 Concrete Patterns. 

2.6322211 (noun(age adjective( adjectival intensifier)))/ 

2.6322212 (noun)/ 

2.632222 Utterances. 

2.6322221 Corresponding to 2.6322211. 

very old hats 

2.6322222 Corresponding to 2.6322212. 
hat s 



2.63223 Dependent Set Labels in Non-Subordinative Conjunction. 

2.632231 Unrestricted Omission of Dependent Set Labels. 

Master Pattern: (person(read verb((read object *t 0),( speed adverb * 

0 ))))./ 



2.6322511 

2.63223m 

2.65223112 

2.65223115 

2.6522511^ 

2.6322312 

2.63223121 

John 



Concrete Patterns. 

(porson(read verb(read object, speed adverb)))./ 
(person(read verb(read object)))./ 

( per ?on( read verb (speed adverb)))./ 

(per son (read verb))./ 

Utterances . 

Corresponding to 2.63223111* 
reads books quickly. 



2.63223122 Corresponding to 2.63223112. 
John reals looks. 

2.63223123 Corresponding to 2.63223113. 
John reais quickly. 




23 



2.632 22512k Corresponding to 2.632225114. 

John roads. 

2.6322.52 Restricted Omission of Dependent Sot Labels 
(Inclusive Disjunction). 

Master Pattern: (person(read verb((read object , (speed adverb < ^)) < 

speed adverb)))./ 

The concrce patterns and utterances are identical with those in 
2.6J223111 through 2.63225115 &nd 2.63223121 through 2.63225123, 
respect ively. 

2.7 Evaluation and Prospects for Future Development. 

Apart from the largo number of sets of parentheses needed to insure 
the proper ordering of operations (cl . 2 . 521 ) and the complexity of the 
notational devices for inclusive disjunction (cf. 2 . 631252 ), our 
notation makes no provision for -he optional no 1 at ion of the non-dependent 
set label in a subordinat ively conjoined pair. Since in patterns almost 
all labels which play a non-dependent role in one subordinat ive conjunctive 
relationship also play a dependent roJe in another such relationship, 
only such set labels us those playing the role of the subject or, in the 
case of subject less imperative sentences, the verb, are not covered by 
our notation. 

There is, however, no reason why we could not cover these cases as 
well, simply by placing the non-dependent set label in exclusive disjunction 
vi + h the dummy label. Thus, if Vo wish to allow tor both the concrete 
pattern (A(R)) and ( B ) , we could write (A * fj)((B)). Such a modification 
would require' a corresponding modified ion of the program discussed in 
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3.112 The Representation of Conjunction in Concrete Patterns. 

The program discussed in section 1 operates on set labels each of 
which is enclosed by a single set of parentheses. It follows that it 
cannot operate on concrete patterns which utilize the devices discussed 
under 2.231 and 2.232. On the other hand, the information on the 
subordinat ive and non- subordinat ive conjunction of set labels which must 
be included in master patterns cannot be simply omitted in generating concret 
patterns, since it provides a major part of the information on government, 
agreement and modification needed to generate the conect forms of 
individual words selected by the program discussed in section 1. Since 
that program permits the inclusion of additional information relevant to 
a given set label after a plus sign following the string of characters 
constituting the label itself and preceding the closing parentheses for 
that label, it would appear that a second function of the program under 
discussion must be the conversion of the master-pattern representation 
of conjunctive relationships to a form compatible with the input format 
specified in 1.1211. 

3.12 Pile Format. 

3.121 Input Files. 

The input for the program consists of a disk file (channel l) 
containing one or more master patterns with the format specified in section 
2. For examples of master pattern input see 2.2312, 2.2323; 2.2413; 

2.53; etc. (Note that there is no provision in the existing master 
pattern notation for tbs inclusion of literals, i.e. concrete words. At 
present the only way to Include a literal in a master pattern notation is 
to represent it as a set label corresponding to a set with a single 
member. ) 

3.122 Out pvt Files. 

3.1221 Equivalence of Concrete Patterns and Input Strings Discussed 
in 1.1211. 

The output of the program consists of a disk file (channel 3) 
identical in formal to that of the input file on channel 1 ment ioned in 
1.121. Each concrete pattern of our output is identical with one of the 
input strings discussed under 1.1211. 

3.1222 The Representation of Subordinat ive and Non- Subordinat ive 
Conjunction. 

As was indicated in 3*1 12, information on conjunctive relationships 
between set labels is coded as additional information accompanying 
individual set labels in accordance with 1.12111, This information takes 
the form of two alphabetic characters immediately following the plus sign 
and themselves followed by a space. The space, which serves only to set 
off the two characters from any other additional information which may 
precede the closing parenthesis for the given set, label, may be omitted 
if there is no additional informat ion. The first of the two characters 
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serves to identify the set label in question, while the second identifies 
a non-dependent set label with which the set label is subordinatively 
conjoined, Thus, the coding (x + ba) indicates that set label (x) is 
represented by the character b and is a dependent set label subordinatively 
conjoined with the non-dependent set label represented by the character 
a. Again (y + ca) states that set label (y) is represented by the 
character c and also depends on the set label represented by the character 
a. 



Whether or not (x) and (y) are coordinat ively conjoined can be 
decided only after a consideration of such factors as their semantic 
compatibility and, in some cases, additional coding. However, it is 
clear that the fact of their non-subordinat ive coordinat ion--the only 
information conveyed by our master-pattern not at ion- -can be established 
simply by ascertaining that they both depend on the set label represented 
by the character a. 

If a set label does not depend on any other set label, the second 
character after the plus is identical with the first. Thus, the sequence 
(z + aa) indicates that the set label (z) is represented by the character 
a and is not a dependent set label in any subordinative conjunctive 
relationship. It would appear that, subject to certain limitations 
connected with clause boundaries, representations such as (x + aa) and 
(y + bb) would indicate that (x) and {y) are non- subordinat ively conjoined 



3.1223 Examples of Output with Corresponding Input and Sample Utterances 



5.12231 Input with No Optional Relationships. 

Input: (person(read verb + write verb(vrite object)))./ 

3.122311 Output. 

3 . 1223 m (person 4 aa)(rcad verb + ba)./ 

3.1223112 (person 4 aa)(vrite verb + ca)(vritc object + dc)./ 

3.122312 Utterances, 

3.1225121 Corresponding to 3.1223111. 

John is reading. 



3.1223122 Corresponding to 3.122311 2. 

John is writing a letter. 

3.12252 Input with Optional Relationships. 

Input; ( pe rson( read verb((read object 4 0),{ speed adverb 4 #))))./ 



3.122521 Output . 



3.1223211 

3.1223212 

3.1223213 

3.1223214 

3.122322 

3.1223221 

John 

3* 1223222 
John 

3.1223223 
John 

3.1223224 
John 



(person + aa)(read verb + ba)(read object + cb)( speed adverb 
+ 4b)./ 

(person + aa)(read verb + ba)(read object + cb)./ 

(person + aa)(read verb + ba)(speed adverb + 4b)./ 

(person + aa)(read verb + ba),/ 

Utterances . 

Corresponding to 3*1223211. 
reads books quickly. 

Corresponding to 3*1223212, 
reads books. 

Corresponding to 3*1223213. 
reads quickly. 

Corresponding to 3*1223214. 
reads . 



3.2 Evaluation and Prospects for Future Development. 

Insofar as the notation discussed in section 2 is a satisfactory 
one, the program under consideration would appear to operate quite 
satisfactorily: it converts master patterns to concrete patterns without 

losing any information on subordinat ive and non- subordi native coordination, 
However, as was pointed out in 2.2311| the sequence of set labels in a 
master pattern and, consequently, in concrete patterns as veil, may 
often differ from the order in which individual words corresponding to 
such labels would occur in normal utterances. It would appear therefore, 
that the concrete patterns produced by the program documented under 3*1 
should be utilized not as input for the program documented under 1,1, 
but as input for a " re-order ing M program, which would have as its output 
concrete patterns identical to those of the input in every respect except 
the sequencing of set labels. The latter would correspond to the normal 
word order of utterances 10 be generated from the queen concrete pattern. 



While it is impossible to give here a detailed outline of the 
operation of such a program, it is clear that it would entail the use 
of transformations based on l) the information on subordinat ive and non- 
subordinat ive conjunction placed after the plus sign accompanying each 
sec label in a concrete pattern, and 2) other information on the semantic 
and/or syntactic properties of members of individual sets. Thus, if in a 
concrete pattern for English utterances l) (x) depends on (y), and 2 ) (x) 
is an adjectival intensifier while (y) is an adjective, the concrete pattern 
rotation (y f aa)(x f ba) would be converted to (x + ba)(y + aa) to insure 
the correct order of such phrases as ve ry good , ext remely bad , etc. 
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I , Program for the Automatic Listing of Coded Vocabulary Items as 

Members cf Semantic Sets. 

4.0 Justification and Purpose. 

The preparation of a set list of the type discussed in 1.1212 
necessitates the association of individual vocabulary items with the set 
labels referring to the semantic sets of which a given vocabulary item 
is a member. However, once this association has been established, there 
is no reason why the actual inclusion of the vocabulary item in the list 
of set members corresponding to each relevant set label cannot be 
accomplished by a computer program. It would appear that such a program 
would significantly reduce the lime and effort required to create and 
update the set list. 

4.1 Documentation. 



4.11 Function of the Program. 

The program operates on l) an alphabetized list of vocabulary Hems 
each of which is followed by the set labels corresponding to the sets of 
which if is a member, and 2) a set list of the type discussed in 1.1212. 
Each vocabulary item is listed among the set members corresponding to each 
of the set labels with which it is associated. If a given set label is 
no* found on the set iis*, it is entered at the appropriate alphabetic 
position in *he list with th^ new vocabulary item(s) as its member(s). 

4 . r i le Format . 

4 . i?l Input Files. 

Tne input for the program consists of l) a disk file (channel l) 
ccntalnlrg a list of new vocabulary items with their associated set labels, 
and ?) a disk file (channel 2) identical in format with the set list 
discussed under 1.12i2. 

4.^211 Vo- a du i ary List Forma*. 

Each vocabulary Hem is immediately preceded by the symbol < and 
immediately followed by * hp symbol >. The set labels associated with the 
vocabulary Hem follow 4 i.e symbol J>. Each set label is enclosed in 
parer. 1 he.: es . riot icr.s cr. :e* - label format are 4 he same as those 

discussed in 1 12121- 



12111 Example 

< boy j 

5 6 ° 2 * 

< house > 

<. red > 



cf Vocabulary .‘. 1 st Format . 

( animat e t^ing ) ( Juveni le) (male) ( person) 
(mo f i o 1 1 vf rb) 

( 1 u i 1 d 1 1 tg ) ( i r a n i ma 4 o 1 e i ng ) ( r e s i d e r. c e ) 
(color adject i vp ) 



4.1212 Lis' Foimat . 

See 1.1212. 
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4.122 Output Files. 

The output consists of a disk file (channel 5) of the s*me format 
as the input set list. It differs from the latter in that it l) lists 
the vocabulary items found on the channel- 1 input file under each of the 
set labels vith which they are associated, and 2) includes set labels 
found in the vocabulary list but. not in the channel-2 input file. 

4.1221 Example of Output with Corresponding Input. 

4.12211 Channel-1 Input. 

< boy 2" (animate being) (juvenile) (male) (person) 

4.12212 Channel-2 Input.. 

(animate being) 

< dog > 

< man > 

< woman > 

( ma 1 e ) 

< man 2 

(person) 

< man > 

< woman > 

4.12213 Output-. 

(animate being) 

< boy ^ 

< dog > 

< man > 

< woman > 

(juvenile) 

< boy > 

(male ) 

< boy > 

< man > 

{ person) 

< boy > 

< man > 

< woman > 

4.3 Evaluation and Prospects for Future Development . 

While the program documented under 4.2 is a useful labor saver, it 
would appear that considerably greater economics could be attained by a 
program which, by taking account of the fact that membership in one ret 
often implies membership in one or more additional sets, would permit a 
significant reduction in the number of set labels associated with 
vocabulary items on the input list. Thus, it would appear that the item 
< boy used in 4,12211, could be coded simply as ( Juvenile) (mal e) , since 
any item associated with the first of these two set labels would also be 
associated vith (animate being) and (person). 
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5. The Element ary Verbal Conanunicator (EVC). A New String-Manipulation 

Language for the PDP-10 Computer* 

5.0 Justification and Purpose. 

5.01 The Need for a Higher -Level Language. 

Although the programs documented in sections 1, 3, and b were written 
in PDP-10 assembly language, it soon became clear that rapid progress in 
the development of programs capable of generating instructional material 
would necessitate the use of a higher-level language. This was due to 
l) the large number of programs to be written, and 2) the interdependent 
character of many of the programs. Thus, the three programs documented so 
far center on a single problem- -the generation of large numbers of 
utterance patterns from a small amount of input. No attention has been 
paid to such complex problems . s the correlation of utterance patterns in 
the base language with those of the target language or the generation of 
inflected forms from the basic forms given in set lists. 

The interdependency of many programs can be seen from the fact that, 
of the three programs discussed so far, the output of those documented 
under 3. 1 and *4,1 serves as input for the one documented under 1.1. This 
means that a modification in one program may easily entail corresponding 
modifications in another Such a chain- like relationship between different 
programs makes "be greater programming speed afforded by a higher-level 
language part icuiariy desirable- 

5.02 Reasons for Developing a New Language • 

The higher 'level languages available oa the PDF -10 system used by 
the principal investigator included Fortran, Gogol, Sail and Lisp Of 
these, 4 he first was no* at all suited for string manipulation of 4 he type 
entailed by our programs. Gogol and Sail were also less than satisfactory 
for our purposes, and, in add i Mon, were r*o f regularly used by tr.e systems 
programmer connected with our project. The fourth language, Lisp, while 
more suf.ed f c our needs, was so inadequately documented that its use might 
have entailed a large number of unforeseen difficulties. 1* was hoped 
4 hat the IXSSS would be able to make available another string-manipulation 
language- *SN ' BvL III. However, 4 his language, which was ideally suited 
for our purposes, did not become available during the contract period. 

lr, view of these difficulties it was decided to develop a string- 
man i pula 4 ion language specifically adapted to tr.e needs of our own research. 
This ape roach proved to have two major advantages. First, i* made the 
principal investigator to a large extent independent of other programmers, 
since the writing of the programs documented above had given him considerable 
familiarity vi*h the FDP-10 assembly language* -the basic tool in the 
crea f icn of 4 he new spring manipulation language. Secondly, it allowed 
him to try a large number cf approaches to one and the same programing 
p rob lem- - a free dorr* a ivay s c ur t a i le d * o $ ome ext e nt by t he u* ills a* icn of 
a pre-exis‘ing language. 




5.1 



Documentation. 



Any program written in the string-manipulation language E VC consists 
of a series of instructions. Each instruction must begin with an operation 
code. An operation code is defined as a string of alphanumeric characters 
immediately preceded by a line feed and immediately followed by the 
tabulation character (henceforth referred to as the tab mark). Operation 
codes may not begin with a hyphen (cf. 5-511) . Most, but not all, instructions 
include one or more additional constituents, henceforth called operands. 
Operands are discussed in the sections dealing with the operation codes 
they accompany. 

5.11 I nput . 

5.111 Disk-file Input. 

’The current version of EVO permits the concurrent usage of two disk 
files for input purposes, (it would, of course, be relatively simple to 
increase this number, but such a modification would entail no fundamental 
changes in the language- -only the creation of additional input instructions.) 
Since the input files currently allowed must be attached to disk channels 
2 and 4, ve shall henceforth refer to them as file 2 and file 4, 
respectively. 

5.1111 Input from File 2. 

5.11111 General Characteristics. 

Input from file 2 is transferred to a core storage area which will 
henceforth be referred to as the item buffer. (The size of the item 
buffer currently permitted is 50 PDP-10 words, i.e., 250 seven-bit ASCII 
characters. It would, however, be a simple matter to develop one or more 
other versions of EVO with item buffers of different lengths.) The 
storing of new input always begins at the first word of the item buffer 
so that newly stored input must be transferred to another storage area 
(see 5 .III 32 below) if it is to survive the execution of subsequent 
input inst rue 4 ions . 

5.11112 Input Operation Codes. 

All instructions resulting in the transferral of input from file 2 
to the item buffer operate on a character- by- character basis, i.e., only 
one seven-bit ASCII character at a time is read into the PDF- 10 
accumulator utilized for such input. The operation codes, which all 
begin with the sequence TOT, fal1 into vV0 6 r0U P*“ -those which commence 
the storage of input in the item buffer with the next available ASCII 
character on file 2, ani those which store no characters until a given 
character or character sequence has teen encountered. The latter type will 
henceforth be referred to as ini iat or-depenient operation codes. 



^.111121 Ope rat ion Coles vi*h Immediate Commencement of Character 
Storage - 

The storage of characters from file 2 in the item buffer continues 
until the program encounters a terminator. In addition to the end-of-file 
mark, the terminator may be a single character, a character sequence, or 
any one of as many as five different characters. Where only a single 
character or character sequence is utilized to terminate the transmission, 
it is possible either to Mop the storage of characters with the last 
character preceding the terminator to store the terminator as well. 

5.1 Li 12 11 The Operation Cede TXTUPll) . 

To is operation code causes the storage in the input buffers of the 
next available character on file 2 and of all characters following it up 
to (but not. including) a given character or character sequence. The 
character or sequence which i: used as a terminator follows the tab mark 
arid is enclosed by a pair of identical delimiting symbols which may take 
the shape of any character not included in the terminator itself. 

Thus, the instruction TXrUPfO x-x will result in the storage in 
the item buffer of the nex 1 available character on file 2 and of all the 
following characters up to the first period. The same result can be 
achieved by TXTUPrO 1.1 , TXTUFI'O - . - , etc. The instruction 
r l l X?Ui TO - and - would resui 4 in the storage of all characters preceding 
the first occurrence of t h r - word arid, while TXTUPPO zbutz would halt 
storage wit h the last character preceding th e first occurrence of the 
string hut. 

5*11112111 Examples of Opera* ion. 

5.111121111 Instruction: TXTUP70 y, y- 

Next available file 2 input. Tom, Dick and Harry. . . 
characters stored in item buffer; Tom 

5 111121112 Tr .true* ion: TX flJPTO , and , 

(life inpu- as in 5-111121111) 

Characters stored: Tom, Lick 

5 . 11 1 i 2 1 2 The 0 pe r a A i c n Cod • • fh'T Hr U . 

This differs from WTUPTO (cf. 5 .1111211) only in that it results in 
the storing of th rt *ermina*or as veil a> the characters preceding it . 

Thus, if ve use IXnUdj in 5.111121111 * he characters stored would to Tom, . 
It s use in 5 . lit 12 111 2 would result in the storage of Tom, Dick and . 

5 .IIII 215 The Operation Code TXT ALT . 

This opera* ion code is identical in its effect with TXTHHU , except T hat 
the terminator my any one of a set of five or fever single characters. 
The '-haraMcrs to be used as terminators immediately follow the tab mark. 
The final charac'er of 1 he se’ is immediately follovri by a period, while 




any others are immediately followed by a comma, which is Itself immediately 
followed by the next terminator. The instruction TXTALT 1 , ? . will 
terminate the transmission of input with the storage of tie first 
exclamation point or question mark encountered. Note that both the comma 
and the period will be treated as terminators, if they occur in an odd- 
numbered position. Thus, the instruction TXTALT . , j , i , ; , ? . would result 
in the termination of input transmission after the storage of the first 
period, comma, exclamation point, semicolon, or question mark encountered 
in the text . 

5.11112131 Examples of Operation. 

5 .IIII 213 H Instruction: TXTALT 

5 .1111 213 111 Next available input: Tom, Dick and Harry. 

Characters stored: Tom, 

5 .1111 2 1 3 11 2 Next available input: Dick and Harry. 

Characters stored: Dick and Harry . 

5 .1111 213113 Next available input: Is he home? No, he's net'. 

Characters stored: Is he home? 

5.111122 Initiator- Dependent Operation Codes. 

As was indicated above, these operation codes do not result in the 
storage of input characters in the item buffer until the program encounters 
an initiator, i.e., a character or string of characters specified in the 
instruction. Storage may commence with the initiator itself or with the 
first character following the initiator. Termination follows one of the 
patterns discussed in 5*1111211 and 5*1111212. Note that in these 
instructions both initiators and terminators are limited to a single 
character or character sequence. It, would* however, be relatively simple 
to develop additional instructions permitting the use of alternative 
initiators and/or terminators (cf. 5*1111213)* 

5.1111221 The Operation Code TXT INC . 

This operation code commences storage of characters in the input 
buffer with the first character of the initiator itself and terminates 
storage with the final character of the terminator. The initiator follows 
the tab mark and is set off in the manner specified for the terminator 
in 5 .IHI 2 II. The closing delimiter of the initiator must be immediately 
followed by the terminator, also set off as in 5.1111211* Thus, for 
example, the instruction TX7INC - and- -but - will result in the storage 

of the first example of the s' ring and encountered on file 2 plus all the 
following characters through the first occurrence of the string but . 



5.11112211 Examples of Operation. 



5 . 111122111 Ins* rue '.ion. I’XTINC /p//y/ 

Next available input: Tom, Dick and Harry* 

Characters stored Dick and Harry 

5-111122112 Instruction: i'XTINd - H a - x . x 

Input as in 5 - 111122111 . 

Char act ers ft ored : Harry. 

5.1111222 The Operation Code TXTEXC . 

This is identical in format and operation with TXTINC , except that 
neither the initiator nor the terminator is stored. Thus, its use in 
5.111122111 vould result in the storage of ick and Harr * In 5*111122112 
it vould s’ ore ar ry only- 

5 111122.5 The Opera’ ion Code TXTEA 1 . 

This opera 1 icn code is identical in format and operation with TXTINC 
and TXTfiXO j except that it stores the terminator and omits the initiator. 
Thus, its use in 5*111122111 vould give ick and Harry . Jn 5*111122112 
it vould store arry . only. 

5 .III 2 Input from Kile! T , 

This follow: exactly the same pa* tern as that described under 
5 . till, even to 1 he use of one and * he same item buffer. Instruction 
format is identical except tha’ operation codes which operate on file 4 
must begin with 1 he sequence TuT ins’ r ad of TXT, e.g., TVrUPTO, TuTALT , 
T^TIN -d, r1 . c . 

5-111} Tne Input of Literals. 

5 . 111}1 Transfer of Li 4 orals to item Puffer. 

It if possible to place a string of one or more ASCII characters in 
the i 1 e m c j f f e r s imply by us i ng T he o pe ration cod 0 IX FLIT . The 
Chirac* ei: : to be stored m the it r m are se* off in tr.e same manner as the 
teimlnator urd with : X'.e f P?0 (of. 5-1UI21L). Thus, 1KTLIT - loo- places 
the three characters boo in the item buffer- The sair.« result is, of course, 
achieved by I XT LIT xtoox or IX?LIT /boo /, etc. Vo place the phrase 

Tom, Pick and Harry i n 1 r** i t em buffer vr write on 1 y I XTLI P x Torn , Pic k 

and H a rryx - 

5.111}2 Input Followed by Storage of Li* r rals and/or Sets of 

Literals in Core Area Other ’ har. Jrpu’ Puffer* 

Sir.ce i r the vast majority of cases it is desirable to be able to 
retain s’ored liberals for future usage, the input of a literal will 
almos* always be followed by i* 5 transfer from the i’em buffer to another 
st 01 age area. In addition, it is of* on convenient to be able to retrieve 
ncr^ than one li*eral a* a time. Tr.e following instructions result in 
l) the storage of a ii’eral or a set of literals in an area of co re other 
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than the item buffer, and 2) the association of the literal or set of 
literals stored with another literal, also stored in an area o r core 
other than the input buffer. The former will henceforth be referred to 
as a class , while the latter will be called a class name . If a class 
includes more than one literal, each individual literal will be called 
a class member . 

5.111^21 The Operation Code NWCLS . 

This operation code, which can be used only when none of the literals 
to be processed includes either a comma, a period, or a percentage sign, 
has the following format: the tab mark is immediately followed by the 
literal which is to become the class name. The latter is immediately 
followed by a comma which is itself followed either by a space or by the 
first character of the first class member. The last class member is 
immediately followed by a period All the others are immediately followed 
by commas, which may be followed by a space preceding the first character 
of the next class member. Thus, the instruct ion fflCLS vowels a,e,i,o,u. 
results in the storage of the literal vowels as a class name associated with 
a class whose members are the five vowel letters preceding the period. 

The instruction NVCLS ?,?, results in the storage of the literal _? 
both as a class name and as the sole member of the associated class. 



3-111522 The Operation Code NC UNTIL 

This may be used in place of NWCL S when it is necessary to store 
class names or class members including either a comma or a period or both 
(no class name or class member may ever include the percentage sign). 

The first character af"er the tab mark is the delimiter chosen to set off 
the class name from the first class member and t he individual class members 
from one another. It must be immediately followed by the first character 

of the class name and it must Immediately follow both the last character 

cf the class name and that of each of the class members. There may, 

however, be a space between the delimiter and the first character of a 

class member. The delimiter immediately following the last character of 
the last class member is itself immodia* ely followed by another delimiter. 
None of 1 he literals to be stored may include the delimiter. 



Thu s , t he ins t rue t ion NCTfl *A /vowel s/a/e/ i /o/u// has t he same 
results as MV C IS vowel, a, e, 1, o , u. ( cf . 3, 111521). However, the latter 

cannot be utilised in place of NCIflIM /averagos/l .2/5>^ / 6.2// , where the 
class member s 1- 2 , 5 ■ 1 6 ■ 2 i nc lude t he pc r iod . 

3.12 Manipulation of Strings vi’hin Core Storage Areas. 

3*121 String Stored in Item Buffer. 



3. 121i 



Deletion of String Characters. 

Cp^ra* ion Cedes Specifying Chat act ers to be Dele'ed. 
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5.121111 The Operation Code KILLST . 

This operation code results in the deletion of the final charac ;er 
and as many as 28 of the immediately preceding characters from the siring 
currently stored in the item buffer. The tab mark is followed by a decimal 
number from 1 through 29. Thus, if the item buffer contains the string 
Harry , the instruction KILLST 1 would reduce the contents to Harr, 
while the instruction KILLST 2 would reduce Harry to Ktr. 

5*121112 The Operation Code KILFRS . 

This is identical with KILLST except that deletion begins with the 
first character and proceeds with the immediately following ones. Thus, 
KILFRS 1 changes Harry to arry , etc. 

5.12112 Operation Codes Specifying Characters to be Retained. 

5*121121 Numerical Specification. 

5.1211211 The Operation Code LKTLST . 

This code causes the deletion of all but the final character and as 
many as 28 of the characters immediately preceding it. Format follows 
that for 5.121111. Thus, UCTLST 5 changes Harry to rry . 

5.1211212 The Operation Code K KTFRS . 

This is the reverse of IKTLST , deleting all but the first character 
plus as many as 28 of the immediately following characters. Thus, 

IiKTFRS 3 changes Harry to Har . 

5*121122 Delimiter- Dependent Specification. 

5.1211221 Specification of Initial Delimiter. 

5.12112211 The Operation Code I KBGNG . 

This code deletes all characters in the input buffer preceding the 
first occurrence of a given character or character sequence. The latter 
is set off as In 5 • 11112 1 1 . Thus, LKBGNG -r- changes Harry to rry . 

5.12112212 The Operation code IXFLLW . 

This differs from IKPGNG only in that the delimiting character or 
sequence is also deleted. Thus, IKFLLW JrJ changes Harry to ry. 

5.1211222 Specification of Torainal Delimiter. 

5.12112221 The Operation Cole LKTHRU . 

This causes deletion of all characters following the first occurrence 
of a given character or character sequence. Format as in 5 *1111211 • 

Thus, IX THR ? J xrx changes lian y to Har . 







5. 12112^22 The Operation Code LKUPTO . 

This differs from LKTHRU only in that the dc?limiting character or 
sequence is also deleted. Thus, IJCUPTQ , r, changes Harry tc Ha. 

5.12115 ^Deletion Dependent on Immediately Preceding Instruction. 

5.121151 The Operation Code LKTBST . 

This operation code, which has no operand, must be used immediately 
after an instruction containing one of the following operation codes: 
NM0GNG, NKFIiLW , NMFRST , NMLAST , NMTHRU , NMUPTO (cf. 5*12122 below). Its 
effect is to delete from the item buffer those characters stored by the 
immediately preceding instruction. Thus, the two instructions NMFRST 1 , 

start (cf. 5.1212212) and IXTRST have the same effect on the item buffer 
as the single instruction KILFRS 1 , i.e. they change Harry to arry . 

The sole difference is that in the former case the H has been stored as 
the sole member of the class start , while in the latter it has been 
irretrievably lost . 

5.1212 Transfer of Characters in Item Buffer to Other Core Areas. 

Transfer of some or all of the characters currently in the item 
buffer to another core area follows the pattern for the storage of literals 
described under 5.11152* The characters to be stored are treated as a 
single literal, i.e. they become the sole member of a class. Since 
storage and retrieval entail the association of the characters with a 
class name, each of the operation codes described below contains either 
the sequence NM or the sequence NAM . 

5.12121 Storage of Entire Contents of Item Buffer. 

5-121211 The Operation Code NMITHj . 

This operation code stores the entire current contents of the item 
buffer as the sole member of a class. The class name follows the tab 
mark and, in contrast to 5.121212, is not subject to modification by the 
program. Thus, NM1IEM label stores the entire current contents of the 
item buffer as the sole member of a class with the name label . 

5.121212 The Operation Code SERNAM . 

This code differs from NM 1 TEM only in that the class name following 
the tab is expanded by one of th^ decimal digits from 0 through 9- The 
digit chosen is determined by the number of times the class name occurring 
after the ope rat ion- code tab mark has already teen used. Thus, the first 
occurrence of SgKHAM label will associate the current contents of the 
item buffer with the class name labcl0 . The next occurrence of this 
instruction associates the item-buffer string with label 1 , the next with 
late!2 , etc. 

5-12122 Transfer of Part of Item-Buffer Contents. 

5-121221 Numerical Specification of Characters to be Stored. 
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5- 12122.. 1 The Operation Code NMLAST . 



This operation code transfers the final character in the . tem buffer 
plus as many as 28 immediately preceding characters to a core area other 
than the item buffer and associates the charac+ *s, which are treated as 
a single class member, with a new class name. The contents of the item 
buffer are not affected, but pointers are set to indicate which characters 
in the buffer were not transferred (cf. 5*121131 and 5 * 1212231 ) . 

The tab mark is immediately followed by a decimal number from 1 
through 29, which is itself immediately followed by a comma. The latter 
is followed by the new class name, with or without an intervening space. 
Thus, the instruction NMLAST 2, end does not change the contents of the 
item buffer, but, if the latter contains the string Harry , stores the 
sequence _ry as the sole member of the class end . In addition it sets 
pointers indicating that the characters Har were not stored by this 
instruction. 

5.1212212 The Operation Code NMFRST . 

This operation code differs from NMLAST only in that it transfers 
the first character and as many as 28 immediately following characters. 
Thus, NMFRST 2, begin leaves Harry unchanged in the item buffer, stores 
Ha as the sole member of the class begin , and sets pointers to indicate 
that the characters rry were not transferred by this instruction. 

5.121222 Delimiter- Dependent Specif icat ion of Characters to be Stored. 

5.1212221 The Operation Code NMFLLW . 

This operation code transfers all characters in the item buffer which 
follow the first occurrence of a given character or character sequence and 
associates them with a new class name as the sole member of the class in 
question. The contents of the item buffer are unchanged but pointers are 
set to indicate the characters not stored as a result of this instruction 
(cf. 5*121131 and 5*121223l). The tab mark is immediately followed by 
the delimiting character or character sequence, set off as in 5 .1111211. 

The latter is followed by the new class name, with or without an 
intervening space. If the item buffer contains Harry , the instruction 
NMFLLW -ar-end stores the sequence as the cole member of the class 
end , leaving the item buffer unchanged, and indicates that Har has not been 
moved . 

5-1212222 The Operation Code NMBGNG. 

This parallels NMFLLW , except that the delimiting character or 
sequence is also stored. Thus, its use in the example for 5.1212221 would 
result in the storage not of r^, but of arry . 

5.1212223 The Operation Code NMUMTO . 

This code parallels KMF LLW except that it stores the characters 
preceding the first occurrence of the delimiting character or sequence. 

Its use in the example for 5*3212221 vould result in the storage of H. 



3.121222^4 The Operation Code NMTKRU. 

This parallels NMUPTO , except that the delimiting character or 
sequence is also stored. Its use in the example for 3* 121222 results 
in the storage of Har . 

3.121223 Transfer Dependent on Immediately Precedirg Instruction. 

3.1212231 The Operation Code NMREST , 

This code, which can be used only immediately following an 
instruction with one of the operation codes NMBGNG , NMjLLW , NMFRST , 
NMLAST, NMTHRU or NMUPTO, transfers all the characters in the item buffer 
not moved as a result of the preceding instruction and associates them 
with a new class name as the sole member of the class in question. The 
contents of the item-buffer are unchanged. The tab mark is immediat ely 
followed by the new class name. Thus, the instruction sequence 
NMFRST 1, start and NMREST end leaves an item buffer containing Harry 
unchanged, but stores the character H as the sole member of tne class 
start and the string arry as the sole member of the class end . 

3.122 Manipulation of Strings Stored in Core Areas Other than 

Item Buffer. 

3.1221 Inclusion of Two cr More Classes in Another Glass. 

It, may often be useful to be able to manipulate the members of two 
or more classes at a single stroke. This can be accomplished by 
including the membership of each of the classes in question among the 
members of a single larger class. 

3 12211 The Operation Code uOMFOS . 

This operation code establishes a new class the members of which 
include all the members of two or more other classes. A new class 
resulting from the u e of this operation code will be called a comp osed 
class, in contrast to a non- com posed class, i.e., a class resulting from 
the use of any other operation code. 

The format of instructions with this operation code exactly parallel 
that of those used with the operation code NVCL3 (cf. 3*111321). Tne nam 
of the new composed class stands between the tab mark and the first comma 
followed by the classes vho^e members will be included in the membership 
of the new composed class. II new composed class may include not only 
the members of non- composed classes but also those of o- her composed 
classes. Note t ha 1 none of the class names cr class members may include 
a period, comma, or percent sign (cf. 3-111321) 

If vp assume ♦he existence of the nor,- composed classes st art and end 
generated as in 3.1212231, ve can gain the abilit y to ranipula 4 e the erC i 

name Har ry by using the instruction COMPOS na re,s 4 art ,er. \ . The r/r.te 

of the new ncr.-ccmpoi.-ci class name would be H ani arry (Note that the 
sequencing cf tne members cf a composed class leflects the order in which 
its component classes are named ir* the T MFC 3 ir»s f ruct icn. ) 
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As a more meaningful example ve may cite the instruct icnal sequence 



NWCLS 


vowels, a,e,i,o,u. 


NWCLS 


liquids, r, 1. 


NWCLS 


nasals, n, m. 


COMPOS 


vocalics, vowels, liquids. 


COMPOS 


sonorants, vocalics, nasals 



The composed class vocaLics includes the members a, e, _i, o, u, r, 
and 1. The composed class sonorants includes all of these plus n and m. 

5.1222 Deletion of Classes. 

The following instructions permit the deletion of strings stored in 
co*'e areas other than the item buffer. Since such strings are always 
associated i'ith a non-composed class, their deletion also entails the 
deletion of the corresponding class name. Note that there is as yet no 
provision in ETC either for the deletion of composed classes or for their 
modification to reflect the deletion of non- composed classes which they 
include. It follows that the deletion Inst ruct Ions must be used with 
extreme care. On the other hand, the deletion feature permits the 
repeated use of one and the same class name with different memberships, 
i.e., it makes possible the storage of both variables and constants in the 
same manner. 

5.12221 The Operation Code DELETE. 

This operation code results in the deletion from core of an entire 
non- composed class- -both the class members and the class name. The tab 
mark is immediately followed by the name of the class to be deleted. 

Thus, assuming that the instructions cited in 5*1212231 had been executed, 
the instruction DECEIT] end would result in the removal from core of 
both the class name end and the class member arry . 

5.12222 The Operation Code LKLSEP . 

Tliis code parallels DELETE except that it deletes the names and 
memberships of all the non- composed classes constituting a series formed 
in accordance with 5*121212 above. 'Thus, if, after executing two of the 
SEP NAM instructions discussed in 5*121212, ve v r i t e PELSEB label , both 
the class names label and label 1 and their members would be deleted. 

5.1223 Transference of Strings to Item Differ. 

5*12231 Transfer of Individual Class Members to the Iter* Buffer. 

These instructions make possible the examination and manipulation of 
individual class members. They are applicable only to non-composed 
classes. 
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5-122511 The Operation Code LOOKAT . 



This operation code transfers the first member of a given non-composed 
class to the item buffer. In addition, it sets a class-membtr pointer to 
the first character of the second member of the class, if any such member 
exists. The class name immediately follows the tab mark. Thus, if we 
assume the previous execution of the instruction NWCLS nouns, hat , dog, 
face, (cf. 5.111521), the instruction LOOKAT nouns places the 
characters hat in the item buffer. 

5.122512 The Operation Code LKCLNX . 

This code transfers to the item buffer that member of a given class 
which is indicated by the pointer mentioned in 5 .122511* In addition it 
resets the class-member pointer to the first character of the next member 
of the class, if any such member exists. The format parallels that for 
5.122511. 

Thus, if we follow the LOOKAT instruction illustrated in 5*122511 
with LKCLNX nouns , the string dog will be transferred to the item 

buffer. If we repeat the LXCINX instruction the item buffer will contain 
face . 

5.122515 The Operation Code LKCRCM. 

This operat ion code also transfers to the item buffer a single member 
of the class whose name follows the tab mark. However, the member chosen 
is the member whose sequential position within the class corresponds to 
that of the last class member of any non-composed class transferred to 
the item buffer by either an LOOKAT or an LKCLNX instruction. Thus, if 
the class verbs has the membership run , go . Jump , and if after manipulating 
the string face , placed in the item buffer by the second LKCINX instruction 
mentioned in 5.122512, ve write IKCRCM verbs , the member transferred 
to the item buffer will be Jump, since it occupies the same sequential 
position within the class verbs , which face occupies within the class nouns . 

5.12251^ The Operation Code IKSRnG . 

This operation cole places in the item buffer the role member of the 
first of a series of one •member cl asses generated by the use of the 
instruction SEPNAM (cf. 5*121212 above). In addition, a series-member 
pointer is set to indicate 1 he class, if any, which is the second member 
of the series in question. The format is the sane as for the operation 
code SERNAM (cf. 5.121212). Thus, the instruction IKSRDG labe l will 
put the class member associated with the class name late 10 In the 
item buffer. 

5.122515 The Operat ion Code 1KSKNX . 

This opei at Ion cole places in the Item buffer the class member 
associated with that class of a series generated by the use of SERNAM 
instructions (cf, 5*121212) which is indicated by the series- member 
pointer mentioned in 5*12251^. It also resets the pointer to indicate the 
next class in the series, if any such exists. Thus, if, after execution 
of the IKSRBJ instruction ritel in the program encounters 



LKSRNX labe l, it will transfer to the item buffer the class member 
associated with the class name labe 1 1 and reset the series-member pointer 
to indicate the class name label? , if such a class exists. 

5 . 1 ? 23 l 6 The Operation Code IKCRSM > 

This code parallels IKCRCM (cf. 5*122313)* except that it operates 
with classes included in series generated by SERNAM instructions 
(cf. 5 . 12121?) . Thus, if after * he execution of the IX SRNX instruction 
cited in 5.12231^ the program encounters the instruction LKCK5M word , 
it, will place in T he item buffer the class member associated with the 
class name word! and set * he series-member pointer to indicate the class 
name word? . 

5-12252 Transfer of All Members of a Class to the Item Puffer. 

5.122521 The Operation Code DUMP. 

This operation code moves the entire membership of a non- composed 
class, with percent age marks separating the individual members, to the 
item buffer. The class name follows the tab mark. Thus, assuming 1 he cam' 
class membership as that given in 5- 122511* the instruction PI MP nouns 
places in *he i*em buffer the sequence hat V : dogs face - 

5.12235 Transfer to the Item Buffer of the Membership of All 
Classes in a Series. 

5.122551 The- 0 p e r a* i 0 n C od e JO IN SR . 

This, operation code places in the item buffer, without intervening 
spaces, * he membership of all the classes in a series generated by 
repeated SER-iAM instructions (cf. 5*121212). The sequencing of class 
r ■ * r.t e r s cor re s p c r. i : to t : :e a s e e n i i ng n u m e r i c a 1 c r d e r of t h e d i g its i n 
t he class n ame s associ at c d v i 1 h *. hem . Th.u s , ii t he class Labels h a s 
1 he r/.mter xv ar, i the class la bel i }.as the member a, and there 5s no 

class laV J2 , * he ins*, rue* ion J id. ’232 label places xyu in t );e item 

buffer. 

5 .122 5 32 T h e 0 j e v a 1 5 . 0 n Co de SR^fKR . 

This differs from J " Hi P R only in f h:T individual class members are 
separated by a space. Its use in the ex cample cited for 5*122331 places 
>; y a in the it e m b u : 2 e r . 

5 . 1 3 0 : : 1 1 r o 1 of Pr og r am K lev , 

5 . 1 5 1 in^cr.h- ional Tr a n s f e 1 . 

5 . 1511 Transfer to c;. Instruction Preceded by a Label. 

A label in an KVC program is a string of characters immediately 
freckled and followed by single hyphens. The initial hyphen must be 
i r me d i at e 1 y r re : e de 1 by a line fee:. V.*h i le t he * c rm i n a 4 i ng hyp he n ne e d 
not be irm.edi V. « lv followed by a line fee:, the program, will ignore all 



the characters which occur between that hyphen and the next line feed. 
(For examples of labels see 5*13111 and 5-13112.) 

5.13111 The Operation Code GOTO . 

This operation code causes the program to take as its next 
instruction the first instruction following a given label. The label, 
minus the preceding and following hyphens, is placed immediately after 
the tab mark. Thus, GOTO middle causes the program to take as its 
next instruction the instruction following the label -middle- . 

5.13112 The Operation Code GOTOIT. 

This operation code differs from GOTO in that the label is not 
specified in the instruction but is assumed to be already present in the 
item buffer. There are no other operands. Thus, if the item buffer 
contains the sequence middle the instruction GOTOIT will have the same 
effect as GOTO middle . 

5.13113 The Operation Code VIS1T - 

This operation code differs from GOTO only in that it sets a 
return pointer to indicate the location of the next instruction following 
the VISIT instruction. The return pointer may then be utilized in 
connection with a GOHOME instruction (cf- 5 - 13121 ). Thus, the use of 
VISIT middle has the same immediate effect on program flow as the use 
of GOT O middle . 

5.1513^ The Operation Code VISITI. 

This differ? from VISIT (cf. 5-13113) Just qs GOTOIT (cf. 5 . 13112) 
differs from GOTO (cf. 5»l?lll). Thus, if the item buffer already 
contains the string middle , VISITI will have the same effect as 
VISIT middle. 



5.1312 Transfer to an Instruction Preceded by a VISIT 
or VISITI Instruction. 

5.13121 The Operation Code GO HOME - 

This operation cole causes the program to take as its next 
instruction the instruction indicated by the return pointer mentioned 
in 5 .I 3 II 3 . I* has no operands. Note that the use of GOHOME must- be 
carefully coordinated with that of VISIT and VISITI . 

5.1313 Omission of Instructions. 

5.13131 The Operation Code SKIP- 

This operation role causes the program to ignore a number (from 1 
to decimal 2 ^) of lines ( instruct ions and/or labels) in selecting the next 
instruction to be executed. The number of lines to be ignored is 

specified by a decimal number following the tab mark. Thus, SK I P 3 

causes the program to take as its next instruction the instruction on the 
fourth line following the SKIP instruction, (if that line is occupied 
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by a label the program will take its next instruction from the following 
line • ) 

c ) i T? Go nd it iona 1 Trane f e r . 

Since all condi ■ ional-t ransfer instructions have operation codes 
beginning with the sequence IF , they will henceforth be referred 10 as 
IF instruct ion&. All IF instructions cause the program to test for tht 
exist c nc e of a giver* c ore i t ion . Depe nd i ng on t he resu It s of t he t e s t , 
the program either proceeds to the next instruction following the IF 
i r i 1 1 uc* ion or omit.* * ha 5 instruction and transfer’, to the following 
ins* rue* ion, i ♦ *.* • , behaves a ? if 1 he JF instruction were the instruction 
skip i (cf. ;.iyt>i), 

^.lf-21 IhansiYj Conditional on 4 he ’d «sult s of a Comparison 
of th" Content? of Item Buff»?r and a Literal. 

i > . i ;)2 1 1 ft e operation Code IV f LFN . 

This e; ‘-ra’iot code causes t he program to omit ; he next instruction 
if t ,\e conh-r/'S of •>,»* i* em b if for are no > identical with a given 11’ era 1. 
The 1 i 1 "i a 1 , -et off a s in f ? • 1 1 lie? 11 , f ol lows t he t ab ma rk . Thu s , it' 
the item cut f“r contains th^ string Harry , t he ins- ruct ion TF IDKN -John- 
wili resul* in the omission of the next instruction. If 1 he item buffer 
doe? con 1 air 4 he s* ring John , * he next instruction will be executed. 

Yne in * rue 1 ion afer an II I PEN in t rue 4 ion is usually an 

unc ondi* itr.a'L * ransfeJ instruction (cf. Thus, 4 he- instruction 

TF1DKN x Jcr. tx rrig.hh be inmediat »?iy followed by GOTO middle sc that. 

the presence of the string John in the item buffer would result in a 
transfer ? he ins’ rue 4 ion following the label -middle- (cf. U.l^Jll). 

S . 1 L? 1 2 7h»> ip^'jsv ice Jod-- T VIJf h 

: Tnis causes s^i-sion ol 4 he following instruction if * he cc/f*nts cf 
the i • em tuff-T do nc* a given literal. Format as i n 3 -Is- 11. 

..thus, if 1 h fi i'em buffer con 1 a in - * Is a’ ring Johnny, t he ins - rue t ion 
II TNG /.Tors */ would not r> ~ul * in an omifbed ins 4 rue* ion, ait heugr, 
ip in:-r . T / v c->.Ti / would . 

S 1.521.51 Ti.v* Op.-ra* icn -Jcj- IHCjJD. 

IT.i ' cau^-t omision of a line if sequence of characters in * he 
i*'m l/l-r including the final eh:rac’ er ar* not iden- ieal wi*h a giv'.n 

liberal. I'onr.a* a in ■ 1 j? 1 1 > T vus, Ti-KHD - y- will no*. result in 

an omi*v:d in ‘ruction vheM.-r * h ^ i’en buffer contains Mar i y or John ny. 

H c-ve ve i , II£NT j - ry- will om i * an i n s * ru c * i oil if * he 1 a t 4 r s t ? i g i s 

enco unt ?. r-. d • 

Tr e Op * 1 r i * ion Go I e TI TtY . 

This diffii ? from only in t ha* he raring in * iter, bufft r 

mu * ir.cluie * h^ firs* characer. Tliu s . IHgG /John/ *• ill not emit 
an ire* rue* ion i: « l*her Jc-M or Johnny is in th* 1 i* cm buffer, In* 

IlbR t -Jehrn- will or;* an in:-* ruction if it * no our/ c rs the former. 



5.1322 Transfer Conditional on Number of Characters in Item Buffer. 

5.13221 The Operation Code 1FLNGRTH . 

This causes omission of an instruction if the item buffer does not 
contain more than a given number of characters. The number (in decimal 
notation) is given after the tab. Thus, IFLNGRTH 5 will omit an 
instruction if the item buffer contains John or Harry , but not if it 
contains Johnny . 

5.13222 The Operation Code IFSHTRTH . 

This omits an instruction if the item buffer does not contain fever 
than a given number of characters. Format as in 5 .13221. Thus, 

IFSHTRTH 5 omits a line if the item buffer contains Harry or Johnny , 
but not if it contains Jonn. 

5.13223 The Operation Code IFEQIjGTH . 

This omits a line if the item buffer does not contain a given number 
of characters. Format as in 5*13221. IFEQLGTH 5 omits an instruction 

if either Joh n or Johnny is encountered, but not if the item buffer contains 
Harry . 

5.133 Transfer Conditional on the Results of a Comparison of the 

Contents of the Item Buffer with One or More Class Members. 

5.1331 The Operation Code IFMEM . 

This operation code omits an instruction if the contents of the 
item buffer are not identical with one of the members of a given class 
(composed or non- composed) . The class name follows the tab mark. Thus, 
assuming the same class membership as in 5*122311? the instruction 
IFMEM nouns will omit a line if the item buffer does not contain one 
(anl only one) of the three strings, dog , face , hat . 

5.1332 The Operation Code IFMB5R . 

This differs f ron IFME* only in that the contents of the item buffer 
av compare i with the membership of each of the classes constituting a 
series generated by the use of BERN AM instructions (cf. 5*121212). Thus, 
IFMRSR label , assuming the same classes and membership a.: in 5*122331? 
will omit an ins* ruction unless the item buffer contains either a or xy . 

5. 1333 The Operation Code IFKKEXTH . 

This operation code omits an instruction if the item, buffer does not 
include a larger number of characters than the number found in the first 
member of a given non- composed class. Format as in 5*1331* Thus, 

ITKKEX TH nou. n s , assuming the same membership as in 5*122311? will omit 

an instruction if the item buffer contains less than four characters since 

ha^ , the firs t re mb e r cf the class in que s t i or., coni a i ns t hre e . 
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The Operation Code IFLSKXTH , 



5.1334 

This differs from IjjjggCTO only in that an instruction is omitted if 
the item bu^ wi * does not contain fever characters than the first member of 
the non-composed class. Thus, assuming the same membership as in 5.122311, 
IFLSEXTH nouns will omit an instruction if the item buffer contains more 
than two’ characters, 

5.1335 The Operation Code IFEQKX . 

This differs from ITMKEXTH only in that an instruction is omitted if 
the item buffer and the first member of the non-composed class do not 
contain the same number of characters, Thus, in the example cited in 
5 . 1 3f4, IFEQKX nou n s will omit a line if the input buffer contains either 
more ur less than three characters. 

5.13 3o The Operation Cede TFLSTH . 

This operation code omits an instruction if the first ASCII character 
in the i t em buffer is not numerically less than the first ASCII character 
of the first member of a given non-composed class. Thus, assuming the 
same membership as in 5.122311, IFLSTH nouns vill omit a line if the 
first ASCII character in the item has an octal value of 150 or more, since 
octal 150 is the value of ASCII h, the first character of hat. 

5.1537 The Operation Code IFGRTH . 

Tills differs from ITLSTH only in that an instruction is omitted if the 
first ASCII character in the input buffer is not numerically greater than 
the first character of the f irst member of a given non-composed class. 

Thus, in the example cited in 5.1336, IFGRTH nouns vill omit a line 
if the first ASCII character in the item buffer has an octal value of 150 
or less. 

5.154 Transfer Conditional on Existence of a Given Non-Composed Class. 

t . 154 1 Th e Op e rati o n 0 o d IFKXST • 

This operation cole omits a line if a given non-composed class is 
not currently stored in core. The class name follows the tab mark. Thus, 
assuming only ‘he classes meu‘ion<d in 5*122311, IFKXST nouns will not 
omit an ins* 2 u:* ion, while IKXST adjs vill. 

5 . 1 > ♦ 2 The 0 pe r at ion C c d e IFISNO . 

This differs from I>EX? only in that the omission of an instruction 
tak^s plate if a given non-comjosed class _i^s currently in core. Thus, 
in the example cited in 5 ■ 1 7 Ji 1 , IFISNO nouns omits an instruction, 
v h i 1 e IFISHO a i j s does n o t . 

5*14 Out pu 1 . 

Cu A j it is always onto a disk file attached to channel *« For this 
reason ve vill. refer *0 the ou‘ pu* file as fil^ 5 . 




Output of Contents of Item Buffer. 



5.141 

5.1411 The Operation Code PUTIT . 

This operation code causes the entire current contents of the item 
buffer to be vritten on file 5* No operands are used. Thus, if after 
the instructions LKLIT abc (cf. 5.11151) and KILFRS 1 (cf. 5.121112) 
the program encounters a PUTIT instruction, the string be vill be vritten 
on file 3* 

5.142 Output of Class Membership. 

5.1421 The Operation Code PUT ALL . 

This operation code causes the program to write on file 3 the 
entire membership of any class--composed or non-composed--vith a carriage 
return following each class member. The class name follows the tab mark. 
Thus, PIET ALL nouns , assuming the same membership as in 5*122311* causes 
the program to write on file 3 the sequence 
hat 
face 
dog 

5.1422 The Operation Code FITTSER . 

This results in the writing on file 3 of the class members corresponding 
to all of the classes in a series generated by repeated SERNAM instructions 
(cf. 5*121212). Each class member is followed by a carriage return. Thus, 
assuming the same classes and class members as in 5*122331* the instruction 
PUTSER label writes on file 3 
xy 
a 

5.143 The Output of Literals. 

5. 1431 The Operation Code PUfLIT . 

This operation code causes a string of characters following the tab 
mark and set off as in 5 *1111211, to be vritten on file 3* Thus, 

FUTL1T ~ abc - writes abc on file 3* 

5.15 Program Delimiters. 

5.151 The Operation Code BBj . 

The first instruction of any EVC program must begin the operation 
code BEG . The tab mark may, but need not* be followed by additional 
material. This material must not extend beyond the first line feed 
following B BJ . 







^ . I 1 ;*^ TV; '3 0 _L> : T at 1 071 Co Je FIN . 

The I ant instruction of any EVC program must begin with th; s 
operation code, tVogrom operation is normally terminated by a GOTO 
instruction (c f. S.ijSll) which leans to the execution of the PIN instruction. 
Material following the tab mark of the PIN instruction is ignored. 

6. Program lor the Conversion of Assertions Concerning 

a Target Language to a Coded Format. 

6 . 0 Jus t i C i oat i on and Purpose . 

6.01 Arch rtypr.l Assertions as a Substitute for a Programming Language. 

One of the major obstacles to the utilization of computational 
techniques by language teachers has been the need to master a programming 
language. Such language no mattei how user-oriented, always entail 
the assimilation 02 row modes of discourse about a familiar subject, a 
task which many land difficult or insurmountable. It would, therefore, 
seem desirable to mike it possible for- language teachers (and other 
scholar..* a-, wll) to interact ►effectively with a computing system "on 
their own ter::/;/ 1 l.t . without the previous acquisition of a syrtom of 
discourse which differs markedly from that which they are accustomed to 
employ. 

It would appear that a satisfactory solution to this problem entails 
an approi'-h to Ur- problem of discourse systems diametrically opposed to 
that inherent in the p r eg r chiming language approach. Instead of forcing 
tb* language teacher to master a new lexicon and grammar, ve must analyze 
the utte: an :es which eharac.tc rizo his own discourse in 1 ho hope of 
deriving therefrom a finite (if, perhaps, large) set of archetypal 
as^rt ior.s, capable of conveying all th*** information which a language 
teacher no ire 11 y wished to crmmunica* . 1: w are fuicrsssful in this 

talk ve can hope to establish efiY c* )V; communist ion between * -he language 
* a. her an i th*: computing .system imply by requiring t.he former to 
l) -VnznlV ,7 all statements ir.t^nh d for computer ecn.'-unp* ion within the 
frame voik of r h c archetypal asm re ion c . included in our r t 1 , and 1 ) supply 
the variables inquir'd to conv*. i" a given archetypal assertion to a 
statwvrt applicable to the language or languages r.e is dealing with. 

(ouch a ctatem*^nf will kvnc‘.:\>i‘ h be called a co ncre te a ssertio n. *1 

Td illustrate -h^ iK-lation of an arehe* vpal assertion let us consider 
4 re * f o 1 1 0 v ; : ;g e ». at-, :v *t r : 
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We submit that each of the above statements is adequately covered 
by the following assertion: The sound represented by Russian stressed Y 
is very similar to the sound represented by the oo of English f toot' . 

This statement may in turn be converted to the archetypal assertion: 
The sound represented by TARGET LANGUAGE VARIABLE is very similar to that 
represented by BASE LANGUAGE VARIABLE. 

It would appear that a language teacher wishing to communicate to a 
computational system the information conveyed by any of our three 
original utterances would l) easily recognize the equivalence of the 
original utterance and our archetypal assertion, and 2) find no difficulty 
in supplying the variables needed for the formulation of a concrete 
assertion. 

6.02 The Value of a Converter Program. 

If a concrete assertion is to serve as a useful tool in the 
generation of programmed instructional material it is necessary to have 
some convenient means of l) distinguishing the archetypal portion of 
the assertion from the variables accompanying it, and 2) retrieving the 
individual variables whenever they may be needed in the generation of 
instructional frames. It follows that it will be useful to convert a 
concrete assertion from its original formulation to 1) a string of 
characters corresponding to the archetypal portion of the instruction 
and constituting an operation code to be utilized in calling computer 
routines used to generate the appropriate instructional frames, and 2) a 
set of literals corresponding to the variables of the concrete assertion 
and each associated with a single label utilized in the frame-generation 



routines 


* 








6.1 


Documentation . 






6.11 


Pune 


ticn of 


Program. 




(Se 


c 6.01 


and 6.02.) 






6.12 


File 
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The 


input 


consists of 
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6.1211 


Use 


of the Tab 


Mark plus a 


Delinitative Character to 



indicate the Beginning of a Concrete Assertion. 



Each concrete assertion must begin with a tab mark. The tab mark 
must be immediately foilcvei by a delinitative character. This may be 
any ASCII character other than the tab mark which does not occur within 
the assertion in a nor-delinitative function. However, it is generally 
rore convenient to use a charac fr r such as a slash or a hyih^n than to 
employ an alphanumeric character. 
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6.1212 



H, - Delimination of Variables within a Concrete Assertion, 

Variables within a concrete assertion are set off by a preceding and 
a following single occurrence of the delimit ative character specified at 
the beginning of Vrr* assertion. Spaces immediately preceding or following 
the delimit at ive character are ignored. 

6.1215 ; of r , h e Se 1 imi * at i ve f : h ara c t e r * . o To rm i na t e a Co nc re t e 

Assertion, 

‘ ir n<j la t non-d'-limitative character of a concrete assertion is 
immediately followed by an uninterrupted sequence of two del imitative 
characters. If the terminal non-del imitative character is part of a 
variable, th>- del imitative character which signals the end of the variable 
also s-*rv^‘ as th- fir:t of th*- two characters used, to terminate the 
a ■ . >‘-rt :c r.. A pe : iol pr* er ding th*. t laminating characters is ignored. 

C.lz'J* Example of Concrete Consti action vi*h Variables*. 

6.121-1 hyph-Tt a 1 * Del imitative Character. 

-'fhs ‘ our.d reprise: ttd by -Russian stressed Y- is very similar io 
* hat represented by -the English vowel in 1 toot 1 

(in this assertion the variables are Russian st ressed Y ar.d the 
Englis h vows . 1 in 1 ; ore. f . ) 

6.12I<j 2 31a r V=t d as Delimit at ivo character. 

/ih.- sound vepie rented by /Russian stressed ?/ is very similar to 
that r**pi vs-entr j by /th** English vowel in 'toot 1 .// 

6.122 Output Vi! . 

It . : oitrul istr> of a disk fii* (chain. -1 t) corfainirg th*- 
i t; 1 4 i a 1 po r tier* of a i \ KV« l prog r am i nc lud i rig one NUT Mil 4 i n s true t i on ( c f . 
t .111 *2c ) foi t act v! the concrete as:- i 4 ions of the input file. 

6.122 1 jjt-1 ini ati\ e uharac'. i . 

M-* dll inina 4 I * rhtra;'. r for th» j NdlM 'Il air:- me 4 ion (si . t.lllj522) 

I ii-Uitisat wi*,h -.hat f:»' * he e-jr.sri-te assertion. 



b.!222 Class Nam* . 

llv- class r.ar.*: i or e ach !« hi M? ! instruction consists of 1 h ^ initial 
le'Vrs of 4 h* vcr.1 which mk- up * he non- variable portion of 4 h-* concrete 
as.*i ! icn co rr*-.- pending to 4 he in * rue* ion. Thus, for + he example cite! 
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6.1224 Example of Output Corresponding to the Input 
Illustrated under 6.1214. 

6.12241 With Delimitative Character as in 6.12141. 

NCIjNIM -tsrbivsttrb-Russian stressed Y-the English vovel 
in ! toot 1 - - 

6.12242 With Delimitative Character as in 6.12142. 

NCLNTM /tsrbivsttrb/Russian stressed Y/the English vovel 
in Hoot 1 // 



6.2 Evaluation and Prospects for Future Development. 

The program documented under 6.1 provides material which could be 
used as input, for more sophisticated EVC programs designed to generate 
instructional frames. Thus, the example cited under 6.1224 could be 
converted to a presentation frame, consisting of the archetypal assertion 
itself, and tvo fill-in frames, one calling for the student to fill in 
the first variable, the other calling for the second variable. 

It would appear that a carefully organized set of archetypal assertion 
would make possible a much more sophisticated set of frame generation 
routines. Thus, if we changed the example cited under 6.1214 to -The 
sound represented by -Russian- stressed -Y- is very similar to that 
represented by the -oo- in the -English- word -toot--, the more discrete 
nature of the variables would permit a much wider variety of reinforcement 
frames. 



7. Conclusions. 

The work carried out during the contract period resulted in the 
development of a number of research tools which, it is ^ o be hoped, will 
prove useful in future work on the automatic generation of materials for 
programmed language instruction. Thus, the programs and notations 
discussed in sections 1 through 3 should prove helpful in the automatic 
generation of a vide variety of phrases and/or ut trances utilizing a 
given vocabulary item. The program discussed in /bet ion 4 facilitates 
the input of information necessary for the successful application of such 
generative techniques to a nev vocabulary. The EVC language discussed 
in section 5 will, it is hoped, greatly increase the speed with which nev 
programs can be written and debugged. Finally, the EVC program discussed 
in section 6 allows language teachers to prepare input for computer 
processing without learning a nev mode of discourse. 

While it is obvious that many problems must be overcome in developing 
automatically generated programmed language- ins* rue tional materials, it 
would seem that we can now proceed to the investigation of suzh problems 
in a much more off icien* manner than was previously possible. 
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